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! 1: AF068748. Mus musculus sphi...[gi:365969I] 



Links 



LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
MEDLINE 
PUBMED 
REFERENCE 
AUTHORS 

TITLE 
JOURNAL 

FEATURES 

source 



AF068748 1815 bp mRNA linear ROD 29-S 

Mus musculus sphingosine kinase (SPHKla) mRNA, partial cds. 
AF068748 

AF068748.1 GI:3659691 



EP-1998 



Mus musculus 

Mus musculus 



g ene 

CDS 



; Metazoa; Chordata; 

Eutheria; Rodent ia; 

1 to 1815) 
, Olivera,A., Edsall,L 



Craniata; Vertebrata; 
Sciurognathi ; Muridae j 



Euteleostomi ; 
Murinae ; Mus . 



Nagiec,M.M., Dickson, R. and 



Eukaryota 
Mammal ia ; 

1 (bases 
Kohama , T . 
Spiegel , S . 

Molecular cloning and functional characterization of murine 
sphingosine kinase 

J. Biol. Chem. 273 (37), 23722 -23728 (1998) 
98395082 
9726979 

2 (bases 1 to 1815) 

Kohama,T., Olivera,A., Edsall,L., Nagiec,M.M., Dickson, R. and 

Spiegel , S . 

Di rect Submission 

Submitted (26 -MAY-1998) Biochemistry, Georgetown University, 3900 
Reservoir Rd, Washington, DC 20007, USA 

Location/Qualifiers 

1 . . 1815 

/organism="Mus musculus" 
/db_xref="taxon: 10090" 

day embryo" 



"8.5 



382 a 



/dev_stage= 
<1..1815 
/gene="SPHKla" 
<1. . 1516 
/gene= n SPHKla" 
/codon_start=2 

/product =" sphingosine kinase" 
/protein_id=" AAC61697. 1 " 
/db_xref ="01:365 96 92" 

/translat ion= " ALLGTPKILRNSVGELQGPLGEKPPSRRDLVHCNPFLSGFVFLL 
DSPLLDFKKRCEDRDLAAPGNDEGAHTAQGGGEGEPHGQSPDTLLGNTDKKLNAGAAV 
TSSSAGAAPVAPCQREPRDLAMEPECPRGLLPRPCRVLVLLNPQGGKGKALQLFQSRV 
QPFLEEAEITFKLILTERKNHARELVCAEELGHWDALAVMSGDGLMHEWNGLMERPD 
WETAIQKPLCSLPGGSGNALAASVNHYAGYEQVTNEDLLINCTLLLCRRRLSPMNLLS 
LHTASGLRLYSVLSLSWGFVADVDLESEKYRRLGEIRFTVGTFFRLASLRIYQGQLAY 
LPVGTVASKRPASTLVQKGPVDTHLVPLEEPVPSHWTWPEQDFLLVLVLLHTHLSSE 
LFAAPMGRCEAGVMHLFYVRAGVSRAALLRLFLAMQKGKHMELDCPYLVHVPWAFRL 
EPRSQRGVFSVDGELMVCEAVQGQVHPNYLWMVCGSRDAPSGRDSRRGPPPEEP" 
511 c 533 g 389 t 



BASE COUNT 
ORIGIN 

1 ggccctcctg gggactccta aaatactgag aaactcggtc ggggaactcc agggtcctct 
6,1 gggagagaag ccaccttcaa ggcgtgacct agttcactgc aatcctttct tatctgggtt 
121 cgttttcctc ttggactcgc ctcttctgga ctttaagaag cgatgcgaag atagagatct 
181 ggccgccccg gggaatgacg agggcgctca cacagcccag ggaggtggag agggcgagcc 
241 ccacggccag tcgccagaca ccctcctggg caacaccgat aagaagctga acgcaggagc 
301 cgccgttacc tctagcagcg ccggggcagc accggtggcc ccttgtcagc gggagccccg 
361 ggacctggct tatggaaccag aatgccctcg aggactgctc ccacggccat gcagagtgct 
421 ggtgctgctg aacccccagg gtggcaaggg caaggctctg cagctcttcc agagccgtgt 
4 81 gcagcccttc ctggaggagg cagagataac ctttaaactg atactcaccg aacggfaagaa 
541 ccatgccagg gagctggtgt gtgcagagga gttgggtcac tgggacgccc tggca^gtcat 
601 gtccggtgat ggtctgatgc atgaagtggt gaatgggcta atggaacggc cagactggga 
661 gactgccatc cagaaacccc tgtgtagcct ccctggaggc tccggcaatg cgctggcagc 
721 ttctgtgaac cactatgctg ggtacgagca ggtgactaat gaagacctgc tcatcaactg 
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781 cacactgctg ttgtgccgcc ggcgcctgtc acccatgaac ctgctgtccc tgcacactgc 

84 1 ttctgggctg cggctctatt ctgtgctcag tctgtcctgg ggctttgttg ctgacgtgga 

901 cctcgagagt gagaagtaca ggcgcttggg ggagattcgt ttcacagtgg gcaccttctt 

961 tcgcctagca agcctgcgca tctaccaagg ccaactggcc taccttcctg taggaactgt 

1021 ggcctctaag agacccgcct ctacactggt gcagaagggc cccgtcgaca cacaccttgt 

1081 tcctctggag gagccagtgc cttctcattg gactgtggta ccagaacagg acttcctcct 

1141 ggtgctggtg ctgctacaca cccacctgag ctccgagctg tttgcagcac ccatgggccg 

1201 ctgtgaagct ggtgttatgc atctgttcta cgtacgtgcg ggggtgtcaa gggctgcgct 

1261 gctgcgcctc ttcctggcca tgcagaaggg caagcatatg gaacttgact gtccatacct 

1321 ggtccatgtg cccgtggttg ctttccgcct ggagcccagg agccagaggg gcgtgttttc 

1381 tgtggatgga gagctgatgg tatgtgaagc tgtgcagggc caagtgcacc caaactacct 

1441 ttggatggtc tgtggcagca gagatgcccc atccggccgg gactcccggc gggggccacc 

1501 tccagaagaa ccataactct gtgcctttgt ctactctgtc taggctgaga tgggaccctc 

1561 ccccgcaccc acttcctggt atgggaggtt atttctaaag ttcctatgga agtggtgggg 

1621 acccctgcag agaaagctag aaggtggggc tatgacttgg aaagaaaggc tctaccttcc 

1681 agttagagta acatccccag tagagccctg ctggctggac cagttgcata tagaagacac 

1741 tccccattgc tcttagggac cttccctggg aaccaaattc aaataaagag acttttccaa 

1801 aaaaaaaaaa aaaaa 
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□ 1: BI946369. 01112 leafy spurg...[gi: 16284977] 



Links 



IDENTIFIERS 



dbEST Id: 

EST name : 
GenBank Acc : 
GenBank gi : 

CLONE INFO 

Clone Id: 
DNA type: 

PRIMERS 

Sequencing : 
PolyA Tail : 



9916317 

01112 

BI946369 

16284977 



3 IAN (5") 
CDNA 



pAD5 
Unknown 



SEQUENCE 



GGCACGAGGCAAACTTCCAGCTCCCAAATGGATCAATCCGATACCTCGCCGACGACGGCC 
ATAATATCGGACAATGTATCGGTCAACGGCACCGTCACGCTTTTGACTTTGACCTCAGAC 
GGCGTGCTCCGGTGGACTGAAAGACSCCAACGCTGTTTGATTCTCGARAAACAGGTMCTT 
GG ATTTGT AATGG AAGG AG AT AAACTAARAATCAAG A CTTT AG TTG AT AACGGTGATGG A 
GTCTGCTGTGGAGGAAGTACCGGASCTTTGGTCAGGAAGGATTTCATATTCCASCCTTTG 
ACTCAARAATCGGGGAGACTTTGGTGCCAGCATCTCCGTGATTATCTCAATTCTCTCGGY 
CG T C C C AGG AGG CTG TTTG T ATT CG TG AATCC ATTTGG AGG AAAG AAG TCTG CTT C AAAA 
GTTTTTCTTGATGTTGTAAAACCATTACTGGAARATGCTGATGTCCAATTCACATTGCAA 
GAAACTAAACGTCAGCTGCATGCCAAGGAAGTTGCTAAGGTTCTGGATCTAAGTAAATAT 
GATGGTATAGT 

Encry Creaced: Oct 19 2001 
Last Updated: Oct 19 2001 

PUTATIVE ID Assigned by submitter 

PIR:T05162 T05162 hypothetical protein F18E5.160 - 
Arabidopsis thaliana EMB:065419 065419 HYPOTHETICAL 138.7 
KDA PROTEIN. 



LIBRARY 

Lib Name: 

Organism : 
Tissue type: 
Develop, stage: 



leafy spurge Lambda HybriZAP 2.1 two-h ybrid vector cDNA 
Library 

Euphorbia esula 
underground adventitious buds 
3 -day induced (decapitated) 



SUBMITTER 
Name : 
Lab: 

Institution: 
Address : 
Tel: 
Fax : 
E-mail : 

CITATIONS 

Title : 

Authors : 
Year: 
Status : 



ND 58105, USA 



Anderson JV 

Plants Science Research 

USDA/ARS, Biosciences Research Lab 

1605 Albrecht Blvd., PO Box 5674, Fargo, 

701 239 1263 

701 239 1252 

ander s j v@f ar go . ars . u s da . g ov 



Identification of mRNAs expressed in underground 
adventitious buds of Euphorbia esula (leafy spurge) 
Anderson, J . V . , Horvath.D.P. 
2000 

Unpublished 
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r 1: AI237625. EST234187 Normali...[gi:3831 131] 



Links 



IDENTIFIERS 



dbEST Id: 

EST name : 
GenBank Acc : 
GenBank gi: 

CLONE INFO 

Clone Id: 
Source : 
Id in host : 
DNA type : 

PRIMERS 

Sequencing : 
PolyA Tail : 

SEQUENCE 



Entry Created: 
Last Updated: 

LIBRARY 

Lib Name: 
Organism: 
Organ: 
Vector : 
R. Site 1: 
R. Site 2: 



2010430 

EST234187 

AI237625 

3831131 



RPLDB60 (3' end) 

ATCC 

2044267 

cDNA 



M13 -21 
Unknown 



GTGGATATGTCAACATAATTGGTCAGTCGAAATTTTCTTAAGTAATATTCACACTTCCCA 
AAGCTCCTGGGCTTTGGGATTCTTCCTTAGGACTCTTCTTCGATCCCCCGAGCAAAGAGG 
CGCACCAGCTGGCAGTGGACCCTGACCTCAATGGCAGGGCTGTGCAGGACTTCCCCATCA 
CAGTTCCAGGAGCTTCTAGAGGTTGGGCAGGCGCAAGGGGGGTTGTCCTTGCAGATCTGC 
CCAAACTTCTGCTTCTCTAGTTCCTTCAAGTCATTGTCATCATCTTCCACGTGCTTTGAT 
GTGAACTGGAATTTCTTGACTCGATAAACTTCAACGAAAGTGAAGCCAAACTGATCCTCC 
TGGTTGGTGTGCCGGATGAGGAATCTCAGGAAGTTGAACCTGGAGCATTTCCGGATAAGG 
ATGAGGTCAGAAGACCCATCTCCCAGATGGGCAAATGGGGACAGGCCCCCAGGGCTCCGG 
GGACAAGCACAGGACATGTTGGTGGAGTTGATAGCCAGGAAC 

Nov 2 1998 
Jan 31 1999 



Normalized rat placenta, 
Rattus s p. 
placenta 
pT7T3Pac 
EcoRI 
NotI 



Bento Soares 



SUBMITTER 

Name : 

Inst itut ion : 
Address : 
Tel : 
Fax : 
E-mail : 

CITATIONS 

Title : 

Authors : 

Year: 
Status : 



Lee, NH 

The Institute for Genomic Research 

9712, Medical Center Drive, Rockville, MD 20850, USA 
(301) -838-3529 
(301) -838-0208 
nhleegtig r . org 



Rat Genome Project: Generation of a Rat EST (REST) Catalog & 
Rat Gene Index 

Lee.N.H., Glodek,A., Chandra, I., Mason, T.M., Quackenbush, J . , 

Kerlavage , A . R . , Adams , M . D . 

1998 

Unpublished 
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C 1: NM_059576. Caenorhabditis el...[gi: 17509012] 



Links 



LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



gene 



CDS 



T10B11.2 1650 bp mRNA linear INV 03-D EC-2001 

Caenorhabditis elegans T10B11.2.p (T10B11.2), mRNA. 

NM_059576 

NM_059576.1 GI: 17509012 

Caenorhabditis elegans 
Caenorhabditis elegans 

Eukaryota; Metazoa; Nematoda; Chromadorea; Rhabditida; 
Rhabditoidea ; Rhabditidae; Peloderinae; Caenorhabditis. 
1 (bases 1 to 1650) 

Annotated by the C. elegans Genome Sequencing Consortium. 
Direct Submission 

Submitted (03 -DEC-2001) National Center for Biotechnology 
Information, NIH, Bethesda, MD 20894, USA 

Location/Qualifiers 

1. . 1650 

/organism= " Caenorhabditis elegans " 
/db_xref = " t axon : 6 23 9 " 
1. .1650 

/gene="T10B11.2" 
/db_xref="WormBase : T1 0B11 . 2 " 
1 . . 1650 

/gene="T10Bll .2" 
/codon_start=l 
/product = "T10B1 1 . 2 .p" 
/protein_id=" NP_ 491977 . 1 " 
/db_xref =»GI : 17509013" 
/db_xref="WormBase: T10B11 .2 " 

/translat ion= "MPNSKKSKKGGDQQHVTIVPVEPVKGENVDTVAYSSRSRISGES 
GHLIADVQSPHAKKHRI IFDRDHNVFEFRLLDGSAKHI IVYRLDELLSTTCYPFKIKN 
GVPIIPTKPTTSDKTLYFNFVYKKDKQKWRLKQIPVIFYTTSERDYWHSLIDTTLRRV 
KNRPKNI 1 1 FINPFGGNGKAQKIFKDNVDAFFWLTPGLRYKWLTERANHARDYI VEM 
PPEQWSAIDGLVSVGGDGLFNELLSGALLRTQTDAGRNIDNPSSHLVTPHIRFGIIGA 
GSANSIVSTVHETNDHATSAVHIAIGSECNVDVCTVHQHQKLIRISANAISYGWLGDV 
LRDSEEYRCLGPIRYQWSALRTTIRHPIYRGMVQFSLSHKENVNPKDQLPPCLEPCPV 
CMKPQGNDKYDYHWHAEFTHVICCVIPTVTPFTPYGLAPFTGIGDGTLDLALVPRISR 
FHNMQFMRKVAMYGGKQLYELDPSLNCYRVTKWSYQPDADQEDPGVWNLDGEILEQPK 
DEPLHFKLHPQLISFFGRDAAMVKPTKRSFIKKRKSSIVYQ" 
317 c 326 g 483 t 



524 a 



BASE COUNT 
ORIGIN 

1 atgccaaatt cgaaaaaaag caaaaaagga ggagatcaac aacatgtcac aattgttcct 

61 gttgaaccag ttaaaggtga aaacgtggac accgttgcat attcatcgag aagccgaata 

121 agtggagaaa gtggtcactt aattgctgat gttcaatctc cacatgcaaa aaagcatcga 

181 attatatttg atagagatca taatgtattc gagttcagat tacttgatgg ttcggcaaag 

241 cacataattg tatatcgtct tgatgagctt ctctccacaa cttgctatcc atttaaaatc 

301 aaaaatggag ttccgatcat tccgacgaaa ccaactacat ctgacaagac actttatttc 

361 aattttgtgt ataagaagga taaacaaaaa tggagactca aacagattcc agtcattttc 

421 tatacgacta gtgaaagaga ttattggcat tcactgattg atactacatt gagaagagtc 

481 aaaaaccgtc cgaagaacat tattatcttt atcaatccgt ttggtggaaa tggaaaagca 

54 1 caaaaaattt tcaaggataa tgttgatgca ttcttctggt tgactcctgg attacggtat 

601 aaagttgttc tcactgaaag agccaatcat gctcgtgact atattgtcga aatgcctccc 

661 gagcagtgga gcgctattga tggattggta tctgttggtg gagatggatt gtttaatgaa 

721 cttctctctg gagctcttct tcgtactcaa accgatgctg gccgtaatat tgacaatcca 

781 tcatctcatt tggtaactcc acatataaga tttggtatta ttggagctgg ttctgccaat 

84 1 tcaattgtgt caactgttca tgaaacgaat gatcatgcca catcagctgt tcatattgca 

901 attggatctg aatgcaacgt ggatgtatgt acagttcatc agcatcaaaa gcttattcga 

961 atttcggcaa atgcaatttc gtatggatgg ttgggagatg ttcttagaga ttctgaagaa 

1021 tatagatgtc tcggaccaat tagatatcag tggagtgctc ttcgaacaac aattcgtcat 

1081 ccaatttatc gtggaatggt acaattctct ttatcacata aagaaaatgt taatccaaag 

1141 gatcaacttc caccgtgtct agagccatgc ccagtttgta tgaaacctca aggaaatgat 
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1201 aaacacgact accaccggca tgctgaattt actcacgtca tttgctgtgt tattccaaca 

1261 gctactccat ttactccata tggacttgct ccttttacag gaattggtga cggaacactt 

1321 gacttggcac tggttcccag aatctctaga ctccataata tgcaatttat gagaaaagtt 

1381 gcaatgtatg gtggaaaaca gctctacgaa cttgatccat ctctaaattg ttaccgggtg 

1441 accaagtggt cctatcagcc agatgccgat caagaagacc ctggcgtttg gaaccttgat 

1501 ggagaaattt tggagcaacc aaaagatgag ccactgcatt tcaagcttca tccacagctt 

1561 atctcattct tcggaagaga cgcagccatg gtaaagccga caaaaagaag cttcataaaa 

1621 aagcgaaaat cttcgatcgt ttatcaatga 
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□ 1: AI47S197. tm50c06.xl NCI_CG...[gi;4371423] 



Links 



IDENTIFIERS 



dbEST Id: 

EST name : 
GenBank Acc: 
GenBank gi: 



2293388 

tm50c06 . xl 

AI478197 

4371423 



CLONE INFO 

Clone Id: 
Source : 

Insert length: 
DNA type: 



IMAGE : 2161546 (3 1 ) 

NCI 

591 

CDNA 



PRIMERS 

Sequencing : 
PolyA Tail : 



-40UP from Gibco 
Unknown 



SEQUENCE 



Quality: 

Entry Created: 
Last Updated: 



GGCAGCGCCAAGCAAATGTCGCCTTCCCCTCGCCACCCCCGGGGCCGCCGCCGCCGCCAG 
CGAGCCCCGAGCTCAGTGTCGCTCAAACCTCCCCTCCCGGCCCCGCTCGGCCCTCCCCCC 
GCGCCCCGCAGACCCTTTCAAACCCCCTGGGCTCGCGGCGGCTGCTCGGTGAGGAAACGC 
TGGCCGCGGCTGGGGCCGGGGCGACGCGGCTGCGAGCGGGGCGGCGGCGAGGCGCCCGGT 
ACGGTGAGCCCAGCAACGAGGCGCGGGCGGGCGCGCAGAGACGCCGGGTCCTCGCGGAGC 
ACGGCCGGGGCGCGGCGGCGGCGGTGGCAGCGGGAGCCCAGGGACCGCTCCATCACTGGC 
AGCCATGGAGCCCGAGCATTTTCCTCCCTCAGGGCAGGCGCCTCGACTCGGCACACGATC 
CTCGGGAGGAAGGGGCCGGCACCCGAGCCTCTCCGGAGGGGCGGCGGACGTGCGCTAGGG 
CGGAGGGTGCGACCCTCTCTTATCCTCTTCCTCCGCAGCCGGCTCTTCCTCACCCTCGTG 
CC 

High quality sequence stops at base: 459 

Mar 5 1999 
Apr 14 1999 



COMMENTS 



PUTATIVE ID 



Tissue Procurement: Christopher Moskaluk, M.D., Ph.D., 

Michael R. Emmert -Buck, M.D., Ph.D. 

cDNA Library Preparation: M. Bento Soares, Ph.D. 

cDNA Library Arrayed by: Greg Lennon, Ph.D. 

DNA Sequencing by: Washington University Genome Sequencing 

Center 

Clone distribution: NCI -CGAP clone distribution information 
can be found through the I.M.A.G.E. Consort ium/LLNL at: 
www -bio . llnl . gov/bbrp/image/image . html 



Assigned by submitter 

SW : VNUA_PRVKA P334 85 PROBABLE NUCLEAR ANTIGEN . 
MER22.bl TAR1 TAR1 repetitive element ; 



; contains 



LIBRARY 

Lib Name: NCI_CGAP_Kidll 
Organism: Homo sa piens 

Organ: kidney 
Lab host: DH10B 

Vector: pT7T3D -Pac {Pharmacia) with a modified polylinker 

R - Site 1: Not I 

R . Site 2: Eco RI 

Description: Plasmid DNA from the normalized library NCI_CGAP_Kid3 was 
prepared, and ss circles were made in vitro. Following HAP 
purification, this DNA was used as tracer in a subtractive 
hybridization reaction. The driver was PCR-a mplified cDNAs 
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from a pool of 5,000 clones made from the same library 

(clonelDs 1322376 -1323911, 1456007 -1456775, and 

1500552 -1502855). Subtraction by Bento Soares and M. Fatima 

Bonaldo. 



SUBMITTER 

Name : 
E-mail : 



Robert Strausberg, Ph.D. 
cqa pbs-r@mail . nih . gov 



CITATIONS 

Title: 

Authors : 
Year: 
Status : 



National Cancer Institute, Cancer Genome Anatomy Project 
(CGAP) , Tumor Gene Index 

NCI -CGAP htt p : //www . ncbi . nlm. nih . qov/ncicq ap 
1997 

Unpublished 



MAP DATA 



Revised: July 5, 2002. 



Disclaimer | VWeJoJheJielpJDesk 
NCBI | NLM | NIH 



http://www.ncbi. nlm. nih. gov/entrez/queryTcgi?cmd=Retrieve&db=nucleotide&list_uids=, 



11/19/2002 



f 




NCBI Sequence Viewer 



Page 1 of 2 




Search [Nucleotide Jf| for 

Limits 




vffNucleptide 

PopSet Toxonomy ( 



Go r.Glear 



Preview/index 



History 



Clipboard 



Details 



default 



;Save 



Text 



;. Add to Glipboard - ;i 



□ 1: D31 133. HUML12618 Human f...[gi : 6440 13] 



Links 



IDENTIFIERS 



dbEST Id: 

EST name: 
GenBank Acc : 
GenBank gi : 

CLONE INFO 

Clone Id: 
DNA type : 



111933 

HUML12618 

D31133 

644013 



(5M 
CDNA 



PRIMERS 

PolyA Tai 1 : 

SEQUENCE 



Unknown 



GCTGAACCCGCGCGGCGCAAGGGCAAGGCCTTGCAGCTCTTCCGGAGTCACGTGCAGCCC 
CTTTTGGCTGAGGCTGAAATCTCCTTCACGCTGATGCTCACTGAGCGGCGGAACCACGCG 
CGGGAGCTGGTGCGGTCGGAGGAGCTGGGCCGCTGGGACGCTCTGGTGGTCATGTCTGGA 
GACGGGCTGATGCACGAGGTGGTGAACGGGCTCATGGAGCGGCCTGACTGGGAGACCGCC 
. ATCCAGAAGCCCCTGTNTAGCCTCCCAGCAG 



Entry Created: 
Last Updated: 



Feb 8 1995 
Feb 8 1995 



LIBRARY 

Lib Name: 
Organism : 



Human fetal lung 

Homo sapiens 



SUBMITTER 

Name : 
Lab: 

Institution : 
Address : 

Tel: 
Fax : 
E-mail : 

CITATIONS 

Medline UID: 
Title: 

Authors : 
Citation : 



Yusuke Nakamura 

Institute of Medical Science 

University of Tokyo 

4 -6-1, Shirokanedai , Minato -ku, 
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1: AA026479. zj99b03.rl Soares...[gi: 1492691] 



Links 



IDENTIFIERS 



dbEST Id: 

EST name: 
GenBank Acc: 
GenBank gi: 
GDB Id: 



640696 

zj99b03.rl 
AA026479 
1492691 
3755135 



CLONE INFO 

Clone Id: 
Source : 

Insert length: 
DNA type : 

PRIMERS 

Sequencing : 
PolyA Tail : 

SEQUENCE 



Quality -. 

Entry Created: 
Last Updated: 



IMAGE:469133 (5* ) 
IMAGE Consortium, LLNL 
1208 
CDNA 



-28M13 rev2 from Amersham 
Unknown 



NCGCCATCCAGAAGCCCCTGTGTAGCCTCCCAGCAGGCTCTGGCAACGCGCTGGCAGCTT 

CCTTGAACCATTATGCTGGCTATGAGCAGGTCACCAATGAAGACCTCCTGACCAACTGCA 

CGCTATTGCTGTNCCGCCGNTGCTGTCACCCATGAACCTGCTGTCTCTGCACACGGCTTC 

GGGGCTGCGCTCTTCTCTGTGCTCAGCCTGGCCTGGGGCTTCATTGCTGATGTGGACCTA 

GAGAGTGATAAGTATCGGCGTCTGGGGGAGATGCGCTTCACTCTGGGCACCTTCCCTGCG 

TCTGGGCAGCCCCTGNGCACCTAANCGCGGCCGACNTGGCT 

High quality sequence stops at base: 165 

Jul 23 1996 
May 9 1997 



COMMENTS 



This clone is available royalty -free through LLNL ; contact 
the IMAGE Consortium { info@ imag e . llnl . gov) for further 
information. 



LIBRARY 

Lib Name: 
Organism: 
Sex : 
Organ : 

Develop, stage: 
Lab host: 
Vector: 
R. Site 1: 
R. Site 2 : 
Descript ion : 



Soares_pregnant_uterus_NbHPU 
Homo sa piens 
female 
uterus 
adult 
DH10B 
pT7T3 -Pac 
Not I 
Eco RI 

1st strand cDNA was primed with a Not I - oligo(dT) primer 
[ 5 • AACTGGAAGAATTCGCGGCCGCCTTTTTTTTTTTTTTTTTT 3'] ( 
double -stranded cDNA was ligated to Eco RI adaptors 
(Pharmacia) , digested with Not I and cloned into the Not I 
and Eco RI sites of the modified pT7T3 vector. Library went 
through one round of normalization. Library constructed by 
M. Fatima Bonaldo. 
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Name : 

Institution: 
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Tel : 
Fax : 
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Wilson RFC 

Washington University School of Medicine 

4444 Forest Park Parkway, Box 8501, St. Louis, 

314 286 1800 

314 286 1810 

est@watson . wustl . edu 
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Generation and analysis of 280,000 human expressed sequence 
tags 

Hillier, L. , Lennon,G. , Becker, M., Bonaldo, M . F . , Chiapelli,B. 
, Chissoe,S., Dietrich, N. , DuBuque , T . , Favello,A., Gish,W., 
Hawkins, M., Hultman,M., Kucaba,T., Lacy,M., Le,M., Le , N . , 
Mardis,E., Moore, B., Morris, M., Parsons, J., Prange,C, 
Rifkin,L. ( Rohlfing,T., Schellenberg, K . , Soares,M.B., Tan,F. 
, Thierry -Meg, J . , Trevaskis , E . , Underwood, K. , Wohldmann, P . , 
Waterston, R. , Wilson, R. , Marra,M. 
Genome Res. 6 (9): 807 -828 1996 
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□ 1: AA232791. zr45fD8.rl Soares...[gi: 1855784] 



Links 



IDENTIFIERS 



dbEST Id: 

EST name: 
GenBank Acc : 
GenBank gi: 
GDB Id: 



872129 

zr45f 08 . rl 
AA232791 
1855784 
5428407 



CLONE INFO 

Clone Id: 
Source : 

Insert length: 
DNA type: 



IMAGE:666375 (5 1 ) 
IMAGE Consortium, LLNL 
1252 
cDNA 



PRIMERS 

Sequencing : 
PolyA Tail : 

SEQUENCE 



Quality : 

Entry Created: 
Last Updated : 



-2 8ml3 rev2 ET from Amersham 
Unknown 



CAGGCTCTGGCAACGCGCTGGCAGCTTCCTTGAACCATTATGCTGGCTATGAGCAGGTCA 

CCAATGAAGACCTCCTGACCAACTGCACGCTATTGCTGTGCGCCGGTGCTGTCACCCATG 

AACCTGCTGTCTCTGCACACGGCTTCGGGGCTGCGCTCGTTCTCTGTGCTCAGCCTGGCC 

TGGGGCTTCGATTGCTGATGTGGACCTAGAGAGTGAGAAGTATCGGCGTCTGGGGGAGAT 

GCGCTTCACTCTGGGCACCTTCCTGCGTCTGGCAGCCCTGCGCACCTACCGCGGCCGACT 

GGCTACCTCCCTGTAGGAAGAGTGGGTTCCAAGACACCTGCCTCCCCCGTTGTGGTCCAG 

C AGGG CC CGGGTAG ATG C AC AC CTTG 

High quality sequence stops at base: 362 

Nov 27 1996 
Aug 6 1997 



COMMENTS 

This clone is available royalty -free through LLNL ; contact 
the IMAGE Consortium ( inf o @ imag e . llnl . gov? for further 
information. 

LIBRARY 

Lib Name: Soares_NhHMPu_Sl 
Organism: Homo sapiens 

Organ: mixed (see below) 

Tissue type: Pooled human melanocyte, fetal heart, and pregnant uterus 
Lab host: DH10B 

Vector: pT7T3D -Pac (Pharmacia) with a modified polylinker 

R. Site 1: Not I 

R. Site 2 : Eco RI 

Description: Equal amounts of plasmid DNA from three normalized libraries 
(melanocyte 2NbHM, pregnant uterus NbHPU , and fetal heart 
NbHHl9W) were mixed, and ss circles were made in vitro. 
Following HAP purification, this DNA was used as tracer in a 
subtractive hybridization reaction. The driver was 
PGR -amplified cDNAs from pools of 5,000 clones made from the 
same 3 libraries. The pools consisted of I.M.A.G.E. clones 
260232 -265223, 34 04 88 -34 54 79, and 4 844 88 - 4 8 94 7 9 . 
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Washington University School of Medicine 
4444 Forest Park Parkway, Box 8501, St. Louis, 
314 286 1800 
314 286 1810 
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Authors : 
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Status: 



WashU -Merck EST Project 1997 

Hillier,L., Allen, M., Bowles, L., Dubuque f T., Geisel,G., Jost 
,S., Kucaba,'T., Lacy,M., Le, N. , Lennon.G., Marra,M., Martin 
,J., Moore, B. , Schellenberg, K. , Steptoe,M., Tan,F., Theising 
,B., White, Y. , Wylie,T., Waterston, R . , Wilson, R. 
1997 

Unpublished 
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□ 1: AI769914. wj30d06.xl NCI_CG...[gi:5236423] 



Links 



IDENTIFIERS 

dbEST Id: 

EST name : 
GenBank Acc: 
GenBank gi: 

CLONE INFO 

Clone Id: 
Source : 

Insert length: 
DNA type : 

PRIMERS 

Sequencing : 
PolyA Tail: 

SEQUENCE 



Quality : 

Entry Created: 
Last Updated: 

COMMENTS 



PUTATIVE ID 



2724987 

wj 30d06 . xl 

AI769914 

5236423 



IMAGE:2404331 (3') 

NCI 

1211 

cDNA 



-4 0UP from Gibco 
Unknown 



TTTGGGAATGTTACTTTATTTGGATTTGGTTCGTGGGGTGGGGGTCTCAAAACAAACTAA 
AAGGCCTTACATAGGCAGCTGGGCCCAGCCAGCTGGGCTCCTGACCCAGGACTTCATTCT 
GGCCTGTCCCCCCAAAGCATAGCCTCCACCTTCTCACCCTTCTCCAGAGGAGTCTCCTCC 
ACCCCCACAGGAGCTGTGGACAGGCCCTGCAGCCCTAGGGAAGGAGGAAGGGTCCTGCAA 
GTAAACACTAAGGCACAACGCGGCCCAGGGGTCATAAGGGCTCTTCTGGCGGTGGCATCT 
G CTG G GG CTTC CAG C TGGG CGGGGG CTC C ACG CAAC CG CTG ACC ATCC AG AAG TAG TTTG 
GGTGCACCTGGCCCTGCACGGCCTCGCTAACCATCAATTCCCCATCCACTGCAAACAGAC 
CTTTCCCATCCTTGGGCTCCAAGCGGAAGGCGACCACGGGCACATATACCAAGTAGGGGC 
ATTCATACTCCATATGCCTGCCCTTCTCCATGGCCAGGAAGAGGCGCAGCAGCATGGCAC 
GAGACACTCCCGCCCGCACGTAGAACAGATGCATGACGCCAGCTGCACAGCGGCCCATGG 
G TG CAG C AAAC ATCTC AC TGG C C AGG TG CG AGTG CAG C AGTGC CAGG ACTAG C AC AAAG T 
TCTCGTCGGGCACCACCTGTCAGTGAGAAGGCACCTGCTCCTCCAGTGGCACCAGGTGTG 
CATCTACCGGGCCCTGCTGGACCACACCNGGGGAAGCANGTGTTCTTGAAAC 
High quality sequence stops at base: 467 

• v.-' 

Jun 2 8 19 9 9 - 
Dec 21 1999 



Tissue Procurement: Christopher Moskaluk, M.D., Ph.D., 

Michael R. Emmert -Buck, M.D., Ph.D. 

cDNA Library Preparation: M . Bento Soares, Ph.D. 

cDNA Library Arrayed by: Greg Lennon, Ph.D. 

DNA Sequencing by: Washington University Genome Sequencing 

Center 

Clone distribution: NCI -CGAP clone distribution information 
can be found through the I.M.A.G.E. Consort ium/LLNL at: 
www -bio. llnl . gov/bbrp/image/image . html 

Assigned by submitter 

TR: 08 8886 08 8 886 SPHINGOSINE KINASE. ; 



LIBRARY 

Lib Name: NCI_CGAP_Kidl2 

Organism: Homo sapiens 

Organ : kidney 

Tissue type: 2 pooled tumors (clear cell type) 

Lab host: DH10B 

Vector: pT7T3D -Pac (Pharmacia) with a modified polylinker 

R. Site 1: Not I 

R. Site 2: Eco RI 

Description: Plasmid DNA from the normalized library NCI_CGAP_Kid5 was 



http://www.ncbi. nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=nucleotide&list_uids=... 1 1/19/2002 



NCBI Sequence Viewer 



Page 2 of 2 



prepared, and ss circles were made in vitro. Following HAP 

purification, this DNA was used as tracer in a subtractive 

hybridization reaction. The driver was PCR-a mplified cDNAs 

from a pool of 5,000 clones made from the same library 

(clonelDs 1323912 -1325831, 1471368-1472903 and 

1492104 -1493255) . Subtraction by Bento Soares and M. Fatima 

Bonaldo. 



SUBMITTER 
Name : 
E-mail : 



Robert Strausberg, Ph.D. 
cqapbs-r@mail . nih. gov 



CITATIONS 

Title : 



National Cancer Institute, Cancer Genome Anatomy Project 
(CGAP) , Tumor Gene Index 

NCI -CGAP http : //www. ncbi . nlm. nih . qov/ncicg ap 
1997 

Unpublished 



Authors : 
Year: 
Status : 
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□ 1: AI972156. wr63c05.xl NCI_CG...[gi:5768982] 



Links 



IDENTIFIERS 



dbEST Id: 

EST name : 
GenBank Acc : 
GenBank gi: 

CLONE INFO 

Clone Id: 
Source : 

Insert length: 
DNA type: 

PRIMERS 

Sequencing: 
PolyA Tail: 

SEQUENCE 



Quality: 

Entry Created: 
Last Updated: 



3098238 

wr63c05 . xl 

AI972156 

5768982 



IMAGE : 24 92360 (3') 

NCI 

1774 

cDNA 



-40UP from Gibco 
Unknown 



TTTTGGGAATGTCACTTTATTTGGATTTGGTTCGTGGGGTGGGGGTCTCAGAACAAACTA 

GAAGGCCTTACATAGGCAGCTGGGCCCAGCCAGCTGGGCTCCTGACCCAGGACTTCATTC 

TGGCCTGTCCCCCCAAAGCATAGCCTCCACCTTCTCACCCTTCTCCAGAGGAGTCTCCTC 

CACCCCCACAGGAGCTGTGGACAGGCCCTGCAGCCCTAGGGAAGGAGGAAGGGTCCTGCA 

AGTAGACACTAAGGCACAGCGCGGCCCAGGGGTCATAAGGGCTCTTCTGGCGGTGGCATC 

TGCTGGGGCTTCCAGCTGGGCGGNGGCTCCACGCAACCGCTGACCATCCAGAAGTAGTTT 

GGGTGCACCTGGCCCTGCACGGCCTCGCTAACCATCAATTNCCCATCCACTGCAAACACA 

CCTTTCCCATCCTTGGGCTCCAAGCGGAAGGCGACCACGGGCACATATACCAAGTAGGNG 

CATTCATACTCCATATGCCTGCCCTTCTNCATGGCCAGNAAGAGGCGCAGCAGCATGGCA 

CGAGACACTCCCCGCCCGACGTAGAACAGATGCATGACGCCAGCTGCACAGCGGCCCATG 

GG TGC AG C AAAC ATCTC ACTGG C C AGG TG CG AG TG C AG 

High quality sequence stops at base: 333 

Aug 25 1999 
Mar 8 2000 



COMMENTS 



PUTATIVE ID 



Tissue Procurement: Christopher Moskaluk , . M . D . , Ph.D., 

Michael R. Emmert -Buck, M.D., Ph.D. 

cDNA Library Preparation: Life Technologies, Inc. 

cDNA Library Arrayed by: Greg Lennon, Ph.D. 

DNA Sequencing by: Washington University Genome Sequencing 

Center 

Clone distribution: NCI -CGAP clone distribution information 
can be found through the I.M.A.G.E. Consort ium/LLNL at: 
www -bio . llnl .gov/bbrp/image/image . html 

Assigned by submitter 

TR:088886 088886 SPHINGOSINE KINASE. ; 



LIBRARY 

Lib Name: NCI_CGAP_Utl 
0 rg a n i s m : Homo s a piens 

Organ: uterus 

Tissue type: well -differentiated endometrial adenocarcinoma, 7 pooled 

tumors 
Lab host: DH10B 
Vector: pCMV -SPORT6 

R. Site 1: Sail 
R. Site 2 : NotI 

Description: Cloned unidirect ionally . Primer: Oligo dT. Average insert 
size 1.75 kb. Life Technologies catalog #: 11538-0 14 
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E-mail : 

CITATIONS 

Title: 

Authors : 
Year: 
Status : 



Robert Strausberg, Ph.D. 
rqa pbs-r@mail .nih. g ov 

National Cancer Institute, Cancer Genome Anatomy Project 
(CGAP) , Tumor Gene Index 

NCI -CGAP htt p: //www.ncbi .nlm.nih. qov/ncicg ap 
1997 

Unpublished 
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□ 1: AA081152. zn34a06.rl Strata.. .[gi: 1623227] 



Links 



IDENTIFIERS 



dbEST Id: 

EST name: 
GenBank Acc: 
GenBank gi: 
GDB Id: 



716400 

zn34a06 . rl 
AA081152 
1623227 
3927815 



CLONE INPO 

Clone Id: 
Source : 

Insert length: 
DNA type: 



IMAGE:549298 (5' ) 
IMAGE Consortium, LLNL 
761 
cDNA 



PRIMERS 

Sequencing : 
PolyA Tail : 



-28M13 rev2 from Amersham 
Unknown 



SEQUENCE 



CACNGAGGACTTTGTGCTAGTCCTGGCACTGCTGCACTCGCACCTGGGCAGTGAGATGTT 
TGCTGCACCCATGGGCCGCTGTGCAGCTGGCGTCATGCATCTGTTCTACGTGCGGGCGGG 
AG TGTC T CG TG CC ATG C TG CTG CG C CTCTTC CTG G C C ATGG AG AAGGG C AGG C ATATGG A 
GTATGAATGCCCCTACTTGGTATATGTGCCCGTGGTCGCCTTCCGCTTGGAAGCCCAAGG 
AATGGG AAAAGG TG TG TTTG CAAG TGG ATGGGG AATTG ATGG TT AGCG AGG C CGTG CAAG 
GGCAAGGTTGCACCCAAACTACTTCTTGGATGGTCAGCGGTTGCGTTGGAACCCCC 



Entry Created: 
Last Updated: 



Jul 18 1996 
Dec 1 1996 



COMMENTS 



This clone is available royalty -free through LLNL ; contact 
the IMAGE Consortium { info@ image . llnl . gov) for further 
information. 



LIBRARY 

Lib Name: 
Organism : 
Develop, stage: 
Lab host : 
Vector : 
R. Site 1: 
R. Site 2: 
Description : 



Stratagene endothelial cell 937223 
H omo s apiens 

umbilical vein, 1 passage 

SOLR (kanamycin resistant) 

pBluescript SK - 

EcoRI 

Xhol 

Cloned unidirectionally . Primer: Oligo dT. Umbilical vein 
endothelial cells, passaged once. Average insert size: 1.0 
kb; Uni -ZAP XR Vector; -5' adaptor sequence: 5' 
GAATTCGGCACGAG 3' -3 ' adaptor sequence: 5' 
CTCGAGTTTTTTTTTTTTTTTTTT 3 * 



SUBMITTER 

Name: Wilson RK 

Institution: Washington University School of Medicine 

Address: 4444 Forest Park Parkway, Box 8501, St. Louis, 

Tel: 314 286 1800 

Fax: 314 286 1810 

E-mail: est@watson . wustl . edu 



MO 63108 



CITATIONS 

Medline UID: 
Title : 



97044478 

Generation and analysis of 280 
tags 



000 human expressed sequence 
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Citation: 



HillierL., Lennon,G., Becker, M. , Bonaldo, M. F . , Chiapelli,B. 

Chissoe,S., Dietrich, N . , DuBuque , T . , Favello,A. , Gish W. , 
Hawkins, M., Hultman.M.. Kucaba,T., Lacy,M., Le,M., Le.N., 
Mardis,E., Moore, B. , Morris, M., Parsons, J., Prange C 
Rifkin,L., Rohlfing.T., Schellenberg , K . , Scares, MB Tan^F. 
, Thierry -Meg, J . , Trevaskis , E . , Underwood, K. , Wohldmann, P . , 
Waterston,R., Wilson, R . , Marra,M. 
Genome Res. 6 (9) : 807 -828 1996 
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□ 1: AL096766. Human DNA sequenc...[gi:5738627] 



Links 



LOCUS HSDA59H18 70890 bp DNA linear PRI 07-J AN-2000 

DEFINITION Human DNA sequence from clone 59H18 on chromosome 22. Contains the 

3' part of the gene for KIAA0767, a novel gene, ESTs, STSs, GSSs 

and a putative CpG island, complete sequence. 
ACCESSION AL096766 

VERSION AL096766.12 GI : 5738627 

KEYWORDS HTG; CpG Island; KIAA0767. 
SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 70890) 
AUTHORS Bates, K. 
TITLE Direct Submission 

JOURNAL Submitted (05 -OCT- 1999) Sanger Centre, Hinxton, Cambridgeshire, 
CB10 ISA, UK. E-mail enquiries: humquery@sanger.ac.uk Clone 
requests : clonerequest@sanger .ac.uk 
COMMENT On Aug 18, 1999 this sequence version replaced gi: 5734408 . 

This sequence has been finished according to sequence map criteria 
as follows. An attempt is made to resolve all sequencing problems, 
such as compressions and repeats, but not necessarily within known 
annotated human repeat sequence elements (e.g. Alu) . Where the 
sequence is ambiguous, there is an annotation using the 'unsure' 
feature key. 

This sequence was generated from part of bacterial clone contigs of 
human chromosome 22, constructed by the Sanger Centre Chromosome 22 
Mapping Group. Further information can be found at 

h t t p: //www. sanqer . ac . uk/HGP/Chr22 

During sequence assembly data is compared from overlapping clones, 
where differences are found these are annotated as variations 
together with a note of the overlapping clone name. Note that the 
variation annotation may not be found in the sequence submission 
corresponding to the overlapping clone, as we submit sequences with 
only a small overlap as described above. 

The following abbreviations are used to associate primary accession 
numbers given in the feature table with their source databases: 
Em:, EMBL; Sw:, SWISSPROT; Tr : , TREMBL ; Wp: , WORMPEP; Information 
on the WORMPEP database can be found at 

http: / /www. sanqer . ac . uk/Pro j ects /C eleqans/worm pep IMPORTANT: This 
sequence is not the entire insert of clone 59H18. It may be shorter 
because we only sequence overlapping sections once, or longer 
because we arrange for a small overlap between neighbouring 
submissions . 

The true right end of clone 439F8 (AL021392) is at 100 in this 
sequence. 59H18 is from the library RPCI-6 constructed at the 
Roswell Park Cancer Institute by the group of Pieter de Jong. For 
further details see htt p: //bacpac . med . buf f alo . edu/ VECTOR: pPAC4 . 
FEATURES Locat ion/Qual i f iers 

source 1 . . 70890 

/organism="Homo sapiens" 
/db_xref ="taxon: 9606" 
/chromosome = "22" 
/clone="59H18" 
/clone_lib="RPCI -6" 
1. .877 

/note-"Tandem repeat. The assembly in this region is 
consistent with the digest." 
1 . . 264 

/note="66 copies 4 mer gatg 74% conserved" 
r ?P.?^l_^9iPi} 10.. 269 



misc feature 



repeat reg ion 
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repeat region 
re peat_ reg ion 
rj§p_ejat_regj^gn 
repeat region 
repeat ^region 
repeat reg ion 
repeat region 
repeat region 
r^£eajt_re_g^on 
misc_featur_e 
misc_feature 
misc feature 
rep_e_at _region 
re^eat_regAon 

reDe_at_regi_P. n _ 

r^P^iLt_^gAp_n 

misc_fea.tu.re 

^ep_e_a_ t_re_g ion 

re p_ea t _ r e g i on 

repejjt_re_g_i_on 

gene 

mRNA 



CDS 



/note="5 copies 52 mer 76% conserved" 
13 . .268 

/note- "16 copies 16 mer 73% conserved" 
13. .264 

/note="9 copies 28 mer 73% conserved" 
15. .254 

/note= n 6 copies 40 mer 75% conserved" 
426 . . 545 

/note="3 copies 40 mer 75% conserved" 
555 . - 866 

/note="78 copies 4 mer gatg 95% conserved" 
555 . . 862 

/note="H copies 28 mer 96% conserved" 
555. . 858 

/note="19 copies 16 mer 96% conserved" 
568. .879 

/note="6 copies 52 mer 93% conserved" 
568 . . 847 

/note="7 copies 4 0 mer 97% conserved" 
1551 . . 1897 

/note="match: GSS : Em:B13877" 
1677. .2176 

/note="match: GSS : Em: AQ50683 7" 
1684 . .2116 

/note="match: GSS: Em:AQ708634" 
2177 . . 2326 

/note="6 copies 25 mer 91% conserved" 
3927 . .4383 

/note= " MLT1D repeat: matches 1..505 of consensus" 
5121. .5224 

/note="2 copies 52 mer 94% conserved" 
6832.. 7316 

/note="L2 repeat: matches 2068.. 2542 of consensus 
8624 . . 9082 

/note="match: GSS: Em:AQ372111" 
8762 . . 8842 

/note="3 copies 27 mer 99% conserved" 
9337 . . 9385 

/note= "MLT1D repeat: matches 261.. 312 of consensus 
9391.. 9572 

/note="L2 repeat: matches 2348.. 2535 of consensus 

10055 . .31655 

/gene="dA59H18.1" 

-join (10055. .10175,13249. .13310,14908. .15040,15706. .15731, 
15894 15985, 17483. .17574,18693. .18741,19964. .20035, 
20557 20600,20667.-20776,24711. .24865,25538.-25682, 
26525. .26618,27336. .27420,28468.-28536,2 9015. .31655) 
/gene="dA59H18 . 1" 

/product="dA59H18.1 (KIAA0767 protein)" 
/note="match: cDNAs : Em:AB018310 

match- ESTs: Em:AI092570 Em:AA103568 Em:H11873 Em:H55444 
Em-AI279741 Em:H45850 Em:AI268624 Em:AI128752 Em:AA830080 
Em-AI689749 Em:AI310136 Em:AI651233 Em:AI376926 Em:T89772 
Em-AI201798 Em:AI925374 Em:AI916869 Em:R12734 Em:AI373013 
Em:AI201646 Em:AA551691 Em:AI342527 Em:AI073809 
Em-AI857492 Em:AI399959 Em:AI884898 Em:AA053921 
Em-AI625017 Em:AA034430 Em:AA769084 Em:AI435899 Em:D19781 
Em-AA287765 Em:AI363150 Em:Z41746 Em:AI580064 Em:R18977 
Em:R39786 Em:AA417669 Em:AI274949 Em:AI751948 Em:R44862'< 
/evidence=not_experimental 

join(<10055. .10175,13249. .13310,14908. .15040, 
15706. .15731, 15894. .15 985,17483. .17574,18693. 
19964 .20035,20557. .20600,20667. .20776,24 711 
25538. .25682,26525. .26618,27336. .27420,28468 

29015. .29119) 

/gene="dA59H18.1" 

/codon_start=3 

/evidence=not_experimental 

/product="dA59H18.1 (KIAA0767 protein)" 

/protein_id=" CAB62976.1 " 

/ 1 rans la t ion= " EELRKLREETNAEMLRQELDRERQRRMELEQKVQEVLKARTEEQ 
MAQQPPKGQAQASNGAERRSQGLSSRLQKWFYERFGEYVEDFRFQPEENTVETEEPLS 
ARRLTENMRRLKRGAKPVTNFVKNLSALSDWYSVYTSAIAFTVYMNAVWHGWAIPLFL 
FLAILRLSLNYLIARGWRIQWSIVPEVSEPVEPPKEDLTVSEKFQLVLDVAQKAQNLF 



. 18741, 
.24865, 
, .28536, 
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re peat _req ion 
repeat_req ion 
re peat _req ion 
repeat region 
repeat^reqion 
re peat _region 
repeatreqion 
repeat _req i on 
misc_f eature 

*B isc_ f eatu re 

r e peat _ re g ion 
re peat _region 
re peat _reg ion 
r e pea t _ r e g _i on 
peatreg ion 
r e p e a t _ region 
? e P e a t r e g i on 
repeat_region 
repeat regi on 
repeat_ region 
re peatre g ion 
repeatregion 
repea t_ region 

misc_f eat ure 

misc feature 

polyA sig nal 

polyA_site 

polyA site 

polyA site 
repeat_reqion 

repeat_region 

re peat _ region 

re peat_req ion 

repeat_req ion 



G KMADI L E KI KNL FMWVQP E I TQKL YVALWAAFLASC F F P YRL VGLA VGL YAG I KFFL 
IDFIFKRCPRLRAKYDTPYIIWRSLPTDPQLKERSSAAVSRRLQTTSSRSYVPSAPAG 
LGKEEDAGRFHSTKKGNFHEIFNLTENERPLAVCENGWRCCLINRDRKMPTDYIRNGV 
LYVTENYLCFESSKSGSSKRNKVIKLVDITDIQKYKVLSVLPGSGMGIAVSTPSTQKP 
LVFGAMVHRDEAFETILSQYIKITSAAASGGDS " 
10098. '.10169 

/note="12 copies 6 mer gctgga 72% conserved" 
11004 . . 11281 

/note=" 139 copies 2 mer cc 55% conserved" 
11004 . . 11227 

/note="8 copies 28 mer 62% conserved" 
11011. . 11130 

/note="3 copies 40 mer 92% conserved" 
11041 . . 11142 

/note="3 copies 34 mer 77% conserved" 
13889 . . 13976 

/note="MER58 repeat: matches 126.. 216 of consensus" 
15365. . 15464 

/note="L2 repeat: matches 2609.. 2710 of consensus" 
16115. . 16191 

/note="MIR repeat: matches 119.. 194 of consensus" 
16998. .17428 
/gene="dA59H18 . 1" 
/note="match: GSS : Em:AQ589694" 
17084 . . 17413 
/gene="dA59H18. 1" 
/note="match: STS : Em:B18088" 
19139. . 19235 

/note="AluSg/x repeat: matches 2 14.. .3 10 of consensus" 
19237.. 19401 

/ note= "MER5 8 A repeat: matches 29.. 206 of consensus" 
20051. .20118 

/note="2 copies 34 mer 93% conserved" 
21370. .21433 

/note="LlM3c repeat: matches 1243.. 1304 of consensus" 
21434 . .21746 

/note="AluYb8 repeat: matches 1..316 of consensus" 
21747 . .21813 

/note="LlM3c repeat: matches 1169.. 1243 of consensus" 
22481 . .22770 

/note="AluSg repeat: matches 1..289 of consensus" 
23131 . .23309 

/note="AluSq/x repeat: matches 3.. 249 of consensus" 
23358. .23553 

/note="AluY repeat: matches 116.. 311 of consensus" 
23555 . . 23712 

/note="AluSg/x repeat: matches 120.. 291 of consensus" 
26967 . .27074 

/note="4 copies 27 mer 94% conserved" 
27047. .27138 

/note="4 copies 23 mer 95% conserved" 
27116. . 27196 

/note="3 copies 27 mer 94% conserved" 
29148 . .29183 

/note="18 copies 2 mer tt 89% conserved" 
complement (312 56 . .31653) 
/note= "match: STS: Em:R44862" 
complement (31347 . .31546) 
/note="match: STS: Em:G43099" 
31628. .31633 
/gene="dA59H18.1" 
31653 

/gene="dA59H18 .1" 
31655 

/gene="dA59H18 .1" 
31659 

32409. .32828 

/note="105 copies 4 mer tcca 85% conserved" 
32412 . . 32827 

/note="26 copies 16 mer 85% conserved" 
32433 . .32832 

/note="10 copies 40 mer 86% conserved" 
32433. .32796 

/note="7 copies 52 mer 87% conserved" 
32442. .32833 
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repeat region 
repeat region 
repeat reg ion 
repeat region 
misc feature 
repeat reg ion 
r^eat^region 
re peat Region 
r^£eat_reg_ion 

g ene 
mRNA 



.1957 of consensus" 



1 . . 133 of consensus" 
1 . .302 of consensus" 



CDS 



.36297) 



/note="14 copies 28 mer 84% conserved" 
32462.. 32817 

/note="178 copies 2 mer ac 57% conserved 
32582.-32827 

/note="41 copies 6 mer ccaccc 58% conserved 
33197 : . 33416 

/note="Charlie4 repeat: matches 1718. 
33446 . .33571 

/note=«L2 repeat: matches 2090.. 2216 of consensus 
33572. .34117 

/note="match: EST: Em:AA059372" 

34697 . .34942 „ 
/note="AluSg repeat: matches 51.-295 of consensus 

34988. .35119 

/note="AluJb repeat: matches 
35140 . .35445 

/note="AluJb repeat: matches 
35622. .35939 

/note="AluY repeat: matches 1..303 of consensus 
complement (join (36277 . . 36716,38462.. 38586)) 
/qene="dA59H18.3" „ 0 „ 0 ^ v 

complement (join (36277 . . 36716, 38462.. >38586) ) 
/gene="dA59H18.3" 

/product="dA59H18.3 (novel protein)" 
/^ 0 -"nrp™d 3' end of gene dA59H18.2 

^ " IsTs Em-AI694771 Em:AA522448 Em:AI131493 Em:C01272 
Em ^36390 *Em AI803415 Em:AI264272 Em:R91886 Em:AA808169 
Fm-AA677192 Em-W67835 Em:W67828 Em:Al075655 Em:AA384221 
E E : : ^ Em AX299775 Em:AI872516 Em:AA682589 EJJ.AA««575 

Em :AI469319 Em:N32016 Em:AI679433 Em:C14 Em ^653 05 
Em:AI886059 Em:N41821 Em:AI719966 Em:AI679943 Em.H50213 
/evidence=not_experimental 
complement (36277) 
/gene="dA59H18.3" 
complement (36278) 
/gene="dA59H18.3" 
complement (362 92 . 
/gene="dA59H18.3" 
36932 . . 37238 
/note="AluSx repeat: 
39097 . . 39220 

/note="2 copies 62 mer 85% conserved" 
39783 . .40083 

" /note="AluSq repeat: matches 1..310 of consensus 
41227 . .41502 

/note="AluSx repeat: matches 40. 
42272.. 42335 

/note="4 copies 16 mer 92% conserved" 

42785 . .42928 

/note="9 copies 16 mer 72% conserved" 

42786. .42921 

/note="4 copies 34 mer 74% conserved" 
42799 . .42958 

/note="40 copies 4 mer atat 69% conserved 
42799 . .42954 

/note="3 copies 52 mer 76% conserved" 
42799.. 42918 

/note="3 copies 40 mer 81% conserved" 

42803 . .42958 

/note="26 copies 6 mer atatat 69% conserved 
42804.. 42957 

/note="77 copies 2 mer ta 70% conserved 

42804 . .42943 

/note="5 copies 28 mer 74% conserved" 
42958 . .43240 

/note-"AluSx repeat: matches 1..285 of consensus 
SlemenUjoin P (43440.. ^645 f 452 95 45371 47078^ 4 7183 , 
51181.. 51333 ,53504.. 53578, 59711.. 59856, 62954.. 63017, 

64003 . .64154) ) 
/gene="dA59H18 .2" 

complement ( join (<43440 . .43645, 45295.. 45371 
47078. .47183,51181. .51333,53504. .53578,59711. 

6 2 954 . .63017, 64 003 . . >64154) ) 
/qene="dA59H18 .2" 

/note="mostly supported by FGENES and GENSCAN 



matches 1..295 of consensus" 



.308 of consensus" 



.59856, 
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repeat reg ion 
re peat region 
repeat_region 
re peat region 
repeatregion 
repeatregion 
misc feature 



match: ESTs: Em:AA355581 Em:H55443 Em:H55199 

match: proteins: Wp:CE18241 Tr:088886 Tr:065419 Tr:088885 

Sw:P19794 Tr:Q12246 Tr:Q06147 Tr : Q9YHI7 Tr:Q63863 

Tr:014159 Sw:046641" 

/codon_start=2 

/evidence=not_experimental 

/product="dA59H18 . 2 (novel protein similar to human, 
mouse, yeast, worm and plant (predicted) proteins)" 
/protein_id=» CAB62977. 1 " 
/db_xref="GI: 6572330" 

/translations" PKHLLVFINPFGGKGQGKRIYERKVAPLFTLASITTDIIGNKFY 
VNYVEVITEHANQAKETLYEINIDKYDGIVCVGGDGMFSEVLHGLIGRTQRSAGVDQN 
HPRAVLVPSSLRIGIIPAGSTDCVCYSTVGTSDAETSALHIWGDSLAMDVSSVHHNS 
TLLRYSVSLLGYGFYGDIIKDSEKKRWLGLARYDFSGLKTFLSHHCYEGTVSFLPAQH 
TVGSPRDRKPCRAGCFVCRQSKQQLEEEQKKALYGLEAAEDVEEWQWCGKFLAINAT 
NMSCACRRSPRGLSPAAHLGDGSSDLILIRKCSRFNFLRFLIRHTNQQDQ" 
43909. .44002 

/note="MER58 repeat: matches 1..95 of consensus" 
44013 . .44320 

/note="AluSq repeat: matches 1..309 of consensus" 
44321. .44490 

/note="AluSg/x repeat: matches 129.. 296 of consensus" 
44491 . .44801 

/note="AluSq repeat: matches 1..312 of consensus" 
44802 . .44817 

/note="AluSg/x repeat: matches 296.. 311 of consensus"" 
44819. .44935 

/note="MER58 repeat: matches 120.. 238 of consensus" 
48989. .49376 

/note="match: GSS: Em:B98715" 



repeat 


_region 


49159. 


.49367 








Of 








/note= 


"AluJb repeat: 


: matches 


86. 


. .312 


consensus" 


repeat_ 


region 


49377. 


.49672 
















/note= 


"AluSx repeat: 


: matches 


1. - 


.296 


of 


consensus" 


repeat 


region 


49838 . 


.49998 
















/note= 


"MER5A repeat 


: matches 


18 


. . 189 


' of 


consensus" 


repeat_ 


reg_ion 


50030. 


.50110 
















/note= 


"MIR repeat: matches 106. 


. 188 


of 


consensus" 


repeat 


region 


50169. 


. 50288 
















/note= 


"L2 repeat: matches 2584. 


.2746 


; of 


consensus" 


teoeaz 


_region 


50477. 


.50786 
















/note= 


"AluSg repeat 


: matches 


1 . 


.293 


of 


consensus" 


repeat^ 


__regiqn 


51675. 


, .51973 
















/note= 


"AluSx repeat 


: matches 


1. 


.295 


of 


consensus" 


rep_eat_ 


_region 


51976. 


. .52290 








of 








/note= 


"AluSq repeat 


: matches 


1 . 


.312 


consensus" 


repeat 


region 


52312 . 


. .52591 
















/note= 


"AluJb repeat 


: matches 


1 . 


.293 


of 


consensus" 


repeat 


region 


52598. 


. .52893 
















/note= 


"AluSq repeat 


: matches 


1 . 


.298 


of 


consensus" 


misc feature 


53160. 


. .53572 
















/note= 


"CpG island" 













r 3. P t _ r_e g ion 
repeatregion 
repeat_region 
repeatregi on 
zep.£& tregion 
™4 sc_ f eature 

? e at _ r . ® .9AP_0 
re peat _reg ion 
repeat region 
repeat_reg ion 
repeater egion 



/evidence=not_experimental 
53769. .54083 

/note="AluSx repeat: matches 1..311 of consensus" 
54084. .54135 

/note="26 copies 2 mer tg 92% conserved" 
54092 . . 54139 

/note="3 copies 16 mer 92% conserved" 
54092 . .54133 

/note-" 7 copies 6 mer tgtgtg 100% conserved" 
54164 . . 54456 

/note="AluSg repeat: matches 1..297 of consensus" 

complement (55838 . .56240) 
/gene="dA59H18.2" 
/note="match: GSS: Em:AQ312951" 

56413. .56712 

/note="AluSg repeat: matches 1..300 of consensus" 
57258. .57545 

/note="AluSx repeat: matches 1..289 of consensus" 
57619 . . 57919 

/note="AluSx repeat: matches 1..296 of consensus" 
58014 . . 58104 

/note="L2 repeat: matches 2649.-2748 of consensus" 
58891. . 59031 
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re peat _ region 
rep_eat _region 
repeat _ reqion 
repeat region 
repeat_region 
re peat reg ion 

r_e peat _ region 
repea t_reg ion 
repea t region 
re peat reg ion 
repeat reg ion 
repeat__re_gipn 
re peat reg_j_on 
^§2eat_re_gion 
re P_eat _reg ion 
repeat _re_g Ion 
rep_eat_ region 
r^p^t_re_gipn 
repea_t_re_g_ion 
re p_e^_re5_ip_n 
re p_e a t_r e g io n 
r e pea t _ region 
re peat reg ion 
re peat reg ion 



/note="MIR repeat: matches 31.. 188 of consensus" 
59908.. 60402 

/note= "MER68B repeat: matches 9.. 550 of consensus 
61385.. 61491 

/note="L2 repeat: matches 2593.. 2708 of consensus" 
61497'.. 62012 

/note=»LlME repeat: matches 2750.. 5384 of consensus 
62015.. 62319 

/note="AluSq repeat: matches 1..307 of consensus 
62320.. 62525 

/note="LlME repeat: matches 5548.. 5773 of consensus' 
63417.. 63452 

/note-"LlPA5 repeat: matches 5969.. 6004 of consensus 
63456.. 63615 

/note="Ll repeat: matches 5211.. 5392 of consensus 
63673.-63967 

/note="AluSg repeat: matches 1..293 of consensus 
64701. .64924 

/note="AluY repeat: matches 88.. 311 of consensus" 
65038.. 65090 

/note= "L1ME3A repeat: matches 5642.. 5696 of consensus 
65091. .65405 

/note="AluY repeat: matches 1..310 of consensus" 
65406.. 65601 

/note= "L1ME3A repeat: matches 5696.-5879 of consensus 
65718 . .65855 

/note="AluJb repeat: matches 
65856 . .66162 

/note="AluSx repeat: matches 
66163 . .66303 

/note="AluJb repeat: matches 132.. 274 of consensus 
66304.. 67349 

/note="LlM4 repeat: matches 3307.. 4378 of consensus 
67394.-67586 

/note="AluJo repeat: matches 124.. 312 of consensus 
67587 . .67771 

/note="LlME3A repeat: matches 4949.. 5133 of consensus 
67772.. 68057 

/note="AluJo repeat: matches 1..282 of consensus 
68060 . .68358 

/note="AluSx repeat: matches 1..299 of consensus" 
68359.-68524 

/ not e=" LIME 3 A repeat: matches 5133.. 5298 of consensus 
68525 . .68838 

/note="AluSx repeat: matches 1..312 of consensus" 
68839.-69647 

/note= " L1ME3A repeat: matches 5297.. 6119 of consensus 
69687 . .69972 

/note="AluSx repeat: matches 21.. 311 of consensus 
70660 . .70890 

/note="AluJb repeat: matches 1..237 of consensus" 
19054 c 20135 g 15480 t 



1..132 of consensus" 
1..306 of consensus" 



BASE COUNT 16221 a 

0RIGIN 1 gatggatgca ttgctggaaa gatggatggg tggatggata aatggatgga tgagtggaca 
61 tgtagatgga tggatgggtg ggtggatgga tggatggatc agtgagtagg ^acgaata 
121 ggtagattag gtaaatggat gggtaaatgt tgggtggatg gatgggttga tgaacagaca 
8 aTtgagtggl tggatglltg gatggatgga tggatggatg gatggatgaa tgggtgagtg 
241 gatggataga tgggtgggtg aatggcagag ggatgggtag atggatgggt ^atgggtgg 
301 atggatggat ggatggatgg atgagtggat ggatggatgg atgggtgagt SS^ggatgg 
361 atlaat^at ^atgggtga gtagatagac agatgggtgg gtgaatggga 9*agggatgg 
421 ttagatgggt gggtgagtgg atgtgtggat ggatggacat gtgggtggat agatggatgg 
III atgatt^at a^atlgatgc atcaatgaga aggcaaacga atgggtagat "ggtaatgg 
541 gtgggtaaat gattgatgga tggatgcatg gatggatgga tggatggatg 9atgcatgga 
601 tglltggatg ^atgcatgga tggatggatg gatgcatgga tggatggatg gatggatgga 
661 tggatggatg catggatgga tggatggatg gatggatgga tggatggatg ^gcatgga 
721 tggatggatg catggatgga tggatggatg gatggatgga tggatgcatg gatggatgga 
Satllatl gat^atgga tgcatggatg gatggatgga tggatggatg gatggatgga 
841 tllatglatg gataggtaga caaatgaacc aataggctgt ttcctccacc aggcatactc 
901 ttttgcatat actttaaggc cttactggcc accttccaga gctccagatg «ccaccttg 
961 ctcccattct gtctggtgtt tggtcccaga tgcagtttta tctgtgccac ctttgcttcc 
1021 tagtgctttg gggcctagag tcaccagctg gaactggagt tggggagtgt ccctgattgc 
1081 ccctagccca cactcagcag ggagcaggtg cttggcagag agcctgttgc tgtgggaagg 
1141 gactgggggc ctggagggct tggctgctgg gtgccaagcg cttggttgga aagggaggtt 
1201 Itgggggcga gggatggcac gtgctgtcga gcacacgact ctcctggctc cgtgctgggc 
1261 cSlcccca gagtcclgcc cacaggctgt cagatgccgc gtgcctctca cagggagagg 
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1321 caggcgtcgg tttctgaaaa ccagctgctc 
1381 cttcttattt gacgtggtaa aaatgtcagt 
1441 ggtgtcctga aatgtaccta aaacggtggt 
1501 attaggatgt gacgcacttg gaagcttgct 
1561 acagttgatc gttcagtaaa cattatctct 
1621 taaatgttag ggagaaaatg caggcaaaac 
1681 ttctctgttt tccctgagca gggagggcca 
1741 gtgctggtgg ctgtagatga agccgtactc 
1801 gattcgatgg agggagtgcc tgtcttctag 
1861 aattgggctg ggagagttca ggtgagagtc 
1921 tgtgcgggcc tccggggatg cgggactccg 
1981 ctgagttcgc gcccctggcc tggtgggaag 
2041 ttattttcgg gtgaggctgg gcctgttgaa 
2101 agagatctac ttttacaggt ttaaaaatag 
2161 cttccctgtg cgctgagtgt cgaggctggg 
2221 ggctgcgtgt cgaggctggg tggagggctg 
22 81 tgaggctggg tggagggctg agcattgagg 
2341 gtctgggcct gtgtaggcct gagccgatgc 
2401 gccgttcact catcagagaa cgttctgggg 
2461 tccctgccct tctcctcgca gatgctgggg 
2521 aaccagcctg gctgtcccag ggcttccttt 
2581 tgtgtcgcca cgccaggctc ttgtcctgtc 
2641 cttcattcct tggagagtgg tggctggtgc 
2701 accagagtca ctcttccttg tgagcgtccc 
2761 tgcaagacgc ctgtgggtct gagggtccct 
2821 ttccatcttc ccctctcctc gacatatttt 
2881 gtgggggcag gccctggccc atcgggtggc 
2941 gggccatctc gctttccttc ctgacaatga 
3001 ggagcagacc cctggatgag aactcacgag 
3061 ccttgggact taggttctag gttccgactc 
3121 cctcccactg ggttccttgc catttggtgg 
3181 cctatttcgg ccggccctcg accctggaag 
3241 catgggatct gcacaagctg tggcctctga 
3301 tgactttctg gtctgatgtt gcgggccacg 
3361 tggcccccgg caggtgtgat atcacacgca 
3421 tgtgcggcgt gtatggctgc cacctggact 
34 81 cccaagacag gtgcaggggg tcttgggaga 
3541 ctgtggcagg tcatgcattt gtgggccacg 
3601 tgctgttcac tgcgggggtt tagagaggct 
3661 ggtggcggca ctggggaatc ctgcttcagg 
3721 ggagagagac ggagctaggg gctgccaggg 
3781 actaaaggag ggcttccagt ggttttatga 
3841 cgccgggcca ggtggaagga agaggactgt 
3901 cttgacacga ttgtctcgag ggagagtgtg 
3961 catgtccccc cagcctgtga atgtggcctc 
4021 tgagtcgccg acctttggtg gagatggttc 
4 081 aaggttctga gaaggagatg gcaggaaggt 
4141 ggtcacagtg acgtcattgc tccctggggg 
4201 aagctggaaa ggcaggaaat gggtctcccc 
4261 gcccacgcca tgagctcagc ccagggagac 
4321 agagaatacg tttgtgttgt tttaagccac 
4 381 gcagaggtca ccaaggggct ccgtggcctg 
4441 cctgcaggac accttggccc tgacctgcag 
4501 gcgtcctcag tgtctgtgag cagctcgggg 
4 561 aacccagagc ctcagcagca gctgcagatt 
4 621 cgagaggaat ctgccgtgtg aatagcagct 
4681 ctcctgtttc ttttcttact cgggaaagag 
4741 ctcctttgtc ctgtggaaag atacagctgt 
4 801 tcgctggtgg gcaggaggca ggccggtgcg 
4 861 cagtttcggg agagacagca gcttggcctt 
4 921 gactcatggg agagcagcgg tcagacgctc 
4 981 taccatcagg aacgacactg ggcaaaggtg 
5041 acaggctgat ttgccacgct gtccgaggag 
5101 gcccgctgcc ccgggagcca ggcatccttt 
5161 tggcgttcac atggcgtcct ctgggctctg 
5221 acatttgtca gggctctgca gagagcctga 
5281 ggctgcctga gacccctttg acctctgaac 
534 1 gcaggcatga catggcatcc caggctccag 
54 01 tgtccctggg ctgggactgg ccaccttggc 
5461 tggaacctgg ctggctgctg ccctttcaca 
5521 gcagccccat ttaggaaaat gtggtttcct 
5581 tctctgacct atggatggag aggcatgtgg 
5641 gaggaggtga tgtgtgggta ctaggagtta 
5701 tggcctactc cgcccaagag gcctgccggg 
5761 ctctgtgttc ccacgtgata gagtcactgc 



tgggtgaaaa ccctggctgg gatcctagtt 
acattatctg tccaccttag atcctgcatt 
tggccagcta tgtgctcagc agccttcaac 
ccttcctggt tatcagccaa gacagtgttg 
gcacatagcg gtaaatgaca ccgcaataaa 
ttcacaataa aagcgaggca tttcttggaa 
cgggaggaag gggttaaaag ggctgcctct 
tgggctctgc agtgtaactt gcagcctggt 
aaagttcctg tggtgatggc aggagggctg 
cgggggaggt ggccaggtgg ggcctctggg 
ccaggccagg agcagctttt tgggtggatt 
gagttttatt taaagattat cacttgattt 
tcagtcgtgt gccaagagga ctcctgcctg 
ccgcagaact gctttccttc ctcccagtaa 
tggagggctg cgtgtcgagg ctgggtggag 
ggcgtcgagg ctgggaggag ggctgagcat 
ctgggtggac ggctgggcct cagccccaca 
atctcctttg cctcccctca aatcagcgtg 
gttgccccgg agtgaccagt gaggtggttg 
aggcgtgaag gctggagcag gtgggtgcac 
agcctctgtg ggggcgattt ctatgtgcca 
ccgggggctc ctagtctgtg gctgattctg 
tgactggaag tgggtggggc tgcctgcagg 
taaaccttgc gctagagctt gacccctgac 
ctgggcacgc agagtgtccc ctgggcttcc 
tctgcccagt ggctcttgtg ataggttgga 
catcctggtt tccctgtgtg ccacgtaccc 
gacggctgat gtgccttcct aggcatcggg 
gcaggcacag atttctcttt ccaggagctc 
actccttggg caggcatttt gggcccctgg 
ctgctgtgac caggtctgct gtgcactcag 
aggcgcagct cactgggctg gaaacagtgc 
tgtggtggct tctctgctgc tgcagggctg 
gctcctatcc ctgggtctcc gcctgtgact 
tcctggtgca cccgggcagc ctcccatccc 
ctgctcacac acgtcttgca catgtctgtg 
gcaggtgccc cgtgctgcca ttaggatgcg 
tgggtgactc tggaacacct cctccatccg 
cggtgtgcac ttttctggtg gggccgcctt 
tgcctttgag ggcggagctg gaggtggggg 
acgctgtact cctttagggc tcctggaccc 
cccaaggact gccaggggca ggaaagtcca 
cccagcctgc ctaggcactt ggttcctttg 
gttggcacac tcatggcccc acagagtgtc 
ccatggcaga agggactttg tagatgtgga 
tggattatct gggggcgcac agtcaccaca 
cagaggaggc catgtgaggg tggggttaga 
ccgcaagcca aggagcacag gagcctcagg 
cttcccggag cctgcagaag gaacagccct 
ccaggctgga cttctgaccc cagaactgta 
tgtgttggtg gtgagccatg ggaaacaaat 
ccctgtgtgt ggcctcgctc ccacccccac 
ggcagggagg cctgtgggtg gggcagcccg 
cacaaggagg cccactccgg aggccgaggg 
ttcggtgcct ggacaggggc cggggctgga 
cagtagtttg ctccgggctg gaaatcggcc 
aaagaaaaat tgatgatggg ggatttgcat 
gctcggcaca ctgcagttga cctgaatgtg 
atggagtcgt ggggcgtgtg atcatcattt 
gcagggagcc gtgtgctcag agataggaag 
cagctggttt cgtttactta gctgtctttg 
ggaagcctca tggctcttcc ggaagctggg 
ccctgtctgc aaggctgagg gggccgtgcc 
gggctctgag cagcgagcat ggcggggctc 
agcagtgaag atggccgggc tctggcgttc 
ccccaggagg tccagggtgg aaagaggggc 
ccaggcttgt gagtgtgctt tttcacacag 
gagggacccg tgtccgcatc catccacatc 
tcaagctttg gccagtgctc tgccgcgtgg 
gcttagtgac tccaggaagg ggcgtagggg 
gttacccgtg cagggagggg ccaaagtgcc 
gctgccttga ccttgacagg gccaagatca 
acctgggaaa gtacctgagc cttcccgcct 
agcaagcctg cccagtgtga cttgtctcac 
cacaacatga agacagaggc atgagatgga 
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5821 aacaccccgt ggctctctct gaaccctccc cagctggctc cgaatttggg actgagccag 
5881 gcggacgctc cctttgggga gaaacaccgg ctgacggggc atttggttcc ttccttcgct 
5941 tatccagtga gccttgatta agccccttct gtatgccagc cacaccgaaa gaggggccag 
6001 gaggctgaga atcgggtctg ttggaggtgg ctggggggta gggtgggagg gcagcatcca 
6061 tctcaggtgg gccaagatgc ccaggctctc ctggctgtca gtgggcgtgt gacccctcgg 
6121 aggaagggtc cccaggctcc tcctgccttg gggcatggac ccgcccacca agggcttcag 
6181 ggccagcccg ggctcacgca gtgggttcac atgctccatg ctgctcttgg ggacccaccc 
6241 aggcaccctc tcagggcagg gtgttgtccc ctctgttcat ggtggtgtcc cggcgccttg 
6301 gcagagcccg gcgtgtgatg attacctgag tatttctgac gaaggtgtgc cagagccggc 
6361 cctcaagtgg ctccacgagg gcggggggct gggggctcgt gccacatgca aggttggcaa 
6421 ggggctgggt ggaggtccct gcagggggtg gggtttgtgg actgatcccc gtgttagggc 
6481 tgtgacaccg aattcccaag tccccgcctg cctccactcc gtgcacggcc cgctggtcag 
6541 cacatctgcc ctccctgcta gacgcactac gtgagaactg gccaagctcc cctcccaacc 
6601 actctccctt ccccacatct tgctggtttc tccttgggcc aggctgggtc ttggggccag 
6661 aacagatggt gacaccgggg tggtccctcc tccggaacct gtggggtgtg ttgggcacaa 
6721 gtcagggagg cctgcaggta atggaggctt ggaaacgtca cactgtacca ggaggttgtg 
6781 tgccatttct gggaatcccc acccagcaca cggagtggga atcatgtccc caagaggaaa 
6841 gagcaggacc cgtgagagaa gatgacagaa ggctacgcca gatggggggg ctgttgggga 
6901 agtcactcgg ggaagtggta ttggggtcag gactgaggct gagggacagc ttaccctgga 
6961 gggtggaaga gaagcttcct cgggagcagg gccagcacat gcaaaagtcc tggggcaggc 
7021 ccgaccctcg gcaaatctga atattttctg gacatttgag gacctaaaag caggtctctg 
7081 tggcttaagc ccaaaaagca aagatgaggg gtggtgtgga gggtgctgag gctgggagtg 
7141 ggtgggtggg ggcctcggga gggagcttca agctgggaag ttggattcat tgccagcccg 
7201 gtgaggagtt ctaagcggga aagcaacaag ttctggcttg tgctttacaa gcagccttgg 
7261 acctcagctt ggagccggac gggagggtgt gccaggagct gggaggccgg ttaagaacat 
7321 ggggatgggg ctggggcagc tgagagacca gcagagtgga tggatctggg atcttggaag 
7381 aaagaacccc agatgtgtag ggggtgaagg gcccccgccc aaatgtggct ggcactgctc 
7441 ccctctgctg taaaggagcc agccggccgc ccagagaggg ctggggggta gctccctcag 
7501 ccctggagac gtgtccccca cagccctggg tcaggggccc tgcagctcct gccacctcct 
7561 tcctgctgct ctgaactgga gccccaggcc gggctggacc ctgcccctaa ttctgaccca 
7621 gaagcctccc caaggcccta ggcagaggcc acctgcctcc cttctgaccc cgcctttgct 
7681 cagctccggg actgagcact tggtaccttg cagcctgtcc tgagccctgg ggaggcaccg 
7741 gcttcagcta acgctctgcc tggccatgtc agcgggtcgg ggacctccgt ccgcggagaa 
7801 aagtagccgt gccagggcca gacgcttctc cccagaacgg aggcggggcg gggaggggga 
7861 gctctggagg gaaccagcct gatgagactg ggcagcccgc agggacaggc ccagtgcccc 
7921 ccaccccagc ctccacgggg agggtcttct ccaggccagc tcagctgccc tggggggctc 
7981 tcgggagtcc aggatgctga aggacactga aaccatgctt ggagaagggc cgatcccagc 
8041 qggccgggga gcagggcaga gctccagcac ccacactggg caagtggatc ctggcccttg 
8101 gtaaattcga atattttctc tctggttagt aacagcgagt aacagcatcc agcaccccca 
8161 gtgggagtgg ggctggtgag atgcaggtgc ctgtctcttc ccttttgggt accgtgagcc 
8221 tgtggctgcc gggcaggaag gtgtgctgac ctttggcttc tgggtttact gaccctagtc 
8281 cacccgcggc tgctccctgc ctgggcacgg ccccctgctg tttctgtacg gctactcctg 
8341 gacattggct gggactgggg cctgaggctg tcatactgcc cttggcccca gttttggagt 
8401 cttaggtatg gccctgtggc cccaagtcat gtggtgaggt gtccaggaca ggaaaggaag 
8461 ctactggggt cactcccacc ctacccagaa agaaacttcc aggcttagaa gcctgccttc 
8521 agccttgggc tgctggccag acccccaagg gttcctgcct cacgctggtg gggtctggct 
8581 gctggccctg cccaagaacc tgggtgaatt catcagccac gttcctttgt ctcaggaata 
8641 tctcctggga ctccagcaaa cgctcttctc ctgctgccat tgccatttta taaggtctca 
8701 gcgtgtccgc cccatggtca gtgtggtctg tgcagttgaa gtcgaggctg actgatctgg 
8761 accctcagcc accataggcc acagccagcc ctcagccacc atgggccaca gccagccctc 
8821 agccaccata ggccacagcc aggcagcctt ctggggagag acagcactta ggtgaccctc 
8881 agctgggttc tgcaagccac tgctgccccg taaaccagca ttgttccacc tgaacttaac 
8941 cgtgaccttg gccctggggc cacctgcctg agcctgtcaa ggagacaggg ttgggctggg 
9001 caggggacac cacacagagg agggtggatg ggctggggtg gggagcgcct gggcagggtc 
9061 ttgtcagtgg cccaggcctc tgagaaggac cgaggacctg tgggcgttca caggtgccag 
9121 ctctctgctc tcatcggtct gttgctcttg ttcttcatgc gtggagcctg tgccaggccc 
9181 tgggagggca gtgggggaga agcaagggcc tcaggcttgc ccttaggtga cccagcggtg 
9241 ttggttttag gaggaagagg cagggatggg gcagaggaag cgggagagag gccgggtccc 
9301 cgtctctcct gtctttaagt ggtagaagtg tcctgactgc cagaggctgc ctgcactccc 
9361 tggctcctgg ccccttccat cctcagtctt ccagtctctc cgacctctga cctctgaccc 
9421 tgcctcctcc ctctgcctgg atgctctttc ccagggtctg tggcctcgct gtgttccctc 
9481 caggccttga ctcagaactc accttctccc gaggctttcc cagagcagcc ctgtgtccag 
9541 cctgccccgt gctcttctca ggccccctgg ccagcttgag ctgtgggctg tccccgctgg 
9601 cccagggcct ttgctctgct gccgacgcgt cccagaacct cacccagcac atggaggcgc 
9661 tgggtgggtg tggtgggctg gccaggtgtg gtcagcgtcc tttccgcagg tgtcctcagc 
9721 gtggcggtgc ttggtgaaat ggcttttgtg ttctgtgttg cactgttatg aagcatagtg 
9781 aggaaattca cacccttgct tcacccacct cttgccccag tcatgtaaga ggagccccgt 
9841 tactagctcc tgcccaggag gaagggctgg atggggacca gttcatgtcc tgctgtgagc 
9901 gccccgcacc ctggaggaag gtcaggtgct caggtccgct tacggagcag agacccctct 
9961 gctgtttctg agtcaggcac ccctgcccca gcatcccaga gtcggggtcg cgtggacatg 
10021 agcgatggtc acctggccgt tctctccccc atagaggagg agctccggaa gctgcgagaa 
10081 gaaaccaacg cggagatgct gcggcaggag ctggaccgcg agcggcagcg gcggatggag 
10141 ctggagcaga aggtgcagga ggtgctgaag gccaggtacc gcgccttcct cggggccctg . 
10201 gcctgtgtgc ggctgcccca accccccacc ccacccgccc cgccgtcctc tgctgcaccc 
10261 atagcgtctt ggcagcatca ttcttggccc tggagaggcc tgagtgtgag ggtgggcccg 
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10321 gctctgactg cccaggtata ggtcctgcct 
10381 tgccatcttc ctgtgcctct ggcggtctgg 
10441 ccaggcctgg ggccaggcca actgaagaga 
10501 gccacctcat ggctctgatc ccggcctgct 
10561 ccccaccccg ggggccagcc cggcctctta 
10621 tgcctgtgcg tgttcccatg ccccctgtct 
10681 tgcccccatc ctagcagtat catgtcctgg 
10741 catgtctgca cctgacatct ttgctcttgt 
10801 cacagtctcc tccctcaccc tgccctgcgc 
10861 ttcaggaccc cagcgcaagt tacaaagcca 
10921 ctccgtgtcc ctctgcacaa gttcaaactg 
10981 gccactcacc tctggctccc aggcctcccg 
11041 cggcctccct ctcagcctcc actccagcag 
11101 actccctcag cctcccccct cagtctccac 
11161 tcatcttcag cctccgtcct cagactccac 
11221 agcctccctc cagcctccac ttagcctccc 
11281 cattggccac acctttgtct gtggccccca 
11341 ggaggcacac ggcatctctg ggacacccca 
11401 gggccgtgtg tcattgtcag gactccaagt 
11461 cccccttcag caccctcagg tttagtgtgt 
11521 ccacttcttc cttgcccgcc tcctgcaggt 
11581 ctcatgcggg ggctcatgtg ggggcatcca 
11641 tcctattccg ccctggcaga acctcccaga 
11701 gctttacctg atgctgaacc ttctgggaaa 
11761 ttaattatgt agagttgcaa cagagacctg 
11821 gtgcggtccc caggcatttg gccacagtgc 
11881 cggtggctaa ggtggcctga ggctgtcccc' 
11941 tcctctcact cccgccagcc ccactccagg 
12001 tggtggcagg tgtgcccagc ccgggtcagc 
12061 agtctctgcc ttgggaatgg gtggggaatg 
12121 aggaagtgga agctgaaggt cctgggagca 
12181 cggcgcttag gaaccttctg gaagcctgaa 
12241 gcagagtgac ctcctctttc cttgtcgtcc 
12301 tgtggtccgt ccagggtggg taggtgtgtg 
12361 cgcacacacc gcacacacac ataccacaca 
12421 ttctgtttat gcaaacctct attttggttt 
12481 ttaaggctac ttgtcagtag gcagagattc 
12541 acctcactgt ctgcttcctg cccgctgctt 
12601 cgccctctgt tctcttctca gggtttactc 
12661 cttcccaaga gttaggaagg ttaggggcga 
12721 cacctgctca caggccttgg ctggaggggc 
12781 ccgtctagtg cccctgccca gtccaggcag 
12841 gtacaggctg ctgacaggtg gcacccagcc 
12901 tgtgtgttga tcagtatctt ttcattagca 
12961 tttaatgatg ttgaaagaag caagtgtgct 
13021 agatgctggt tgtttaaaaa caaaacgtgt 
13081 tggatacgct cggaaccagc atttcagcag 
13141 tgacctctcc tgtgcagtac acggtcgcga 
13201 agtctggaac taacagacct cttgtctctt 
13261 gatggctcag cagcccccaa aagggcaggc 
13321 gtgggcgggt ggacaggcag gcgggcgggt 
13381 aggcccaggt taacaatgga gcagataccc 
13441 gctggttctg tgcccaggca cctgctggca 
13501 tagtgtctct gatgcgaagt ggtgagtcat 
13561 caactaaagc gtgattgtgt gttctttggg 
13621 cccacggaac agtgatgtga gagccacgct 
13681 aaggggacca agtgggcagt ggggctatgg 
13741 gtgccctttt atctctctcc tcgtcctcca 
13801 acgccctgca ggcagccggc atagttccct 
13861 gcctggcttc acagatttaa atggtttcat 
13921 aaatttcagt ccataaataa agtccatggg 
13981 ggacgagtgt gccgatctca cctggaccct 
14 041 gggcgggaag aagccgcagg agcctgccct 
14101 gggtacacag acctcagccc ccttggcctg 
14161 gccccctggg tatggtgacc acaggccctc 
14 221 agcacccgct gcaggcggct cctcactggc 
14281 tgagcccccc ttccgaggga ttcttccggt 
1434 1 gttcgccttt ggtaaacatt gctggtcatc 
144 01 ctggctgcgc tcgccaggca cccccagccc 
144 61 ggctgatgct caggatatgc agggctgggg 
14 521 ccggaggtag aaagggggtt tgatgcagac 
14 581 caggctgccc gctgacgaga gcccagggcg 
14641 agagagaccc ggcactgagg tcgtcatggc 
14701 cgcacttggt gtcttcagag gactgagccc 
14761 ttccctgacc tacccccgag cattccttgt 



gtgcctgcat gagctgtggt cttgggaatg 
aaaagggtgc acaggtgctc ctgccagccc 
ggcttggcat ggggcctgca gagccccaga 
ggtgacccca ttctgtacat ccacccagtg 
ggatcttccc aggacctgcc ttcccagtgc 
gccccagcca cgctgtggcc cccgcccccc 
gcgccagtgc cgccctggct cctagagagg 
ttgtgggtcc catgtgggac cctgagtgct 
ctcacagtgc cgccagcagg aagcaggtca 
gtgccctcct gtctcgctaa caccacgagg 
gtgcctggta catgaggcca gggagctcag 
ctcagcctcc actccagcaa cctcctgctt 
cctcctgctt cggcctccct ctcagcctcc 
cctcagtttc tgcccagcct ccccccagca 
tccctcagcc tccccactca ggttccactc 
tccaggtctc ccacctcagc ctcccaccac 
tgacctgcat ggatggctgt caccagttag 
gctgagcact gcctgctatg agcgctccgg 
gtcgctgctt agctggccct gaggaagcgg 
gggaagacgg gggatccgct gggctgactc 
cagcatgacc ttgtggagtg ccctgggggg 
gctggtgtgg gggagggaga ggcctcactt 
ccagaaatgg gtgcatgggc ctcggcaggc 
atccacagaa gctcaaagct taaattacat 
acctgccacg ctgggagatc ccagggacag 
catccccgcc acgggacgct gcttctcccc 
gtggccctgc tccctgtgtt ctgtttccca 
atctggtcct catcttttac caaagcccct 
tgtaagaacg cagggagccc catctcgcac 
ggcagtgaag gaacatccag gctgtgaccc 
ttaagggctc tcccctgctt tacatggggg 
ggaggagggg tgaaggaaga gcagtcccag 
tggtgcagat ctggggtggg tctgggtggc 
agggcacaca caaacatgca gacacatgtg 
ctctgaagct gcaggattaa agcacagcag 
ttaagaaaat tctcagaaag tcaccttggt 
tctccattgt ggatccaaag tcccctgaaa 
acctggccgt gccctgtgtg ccctgtgctc 
ctcccccagc ttcctccctc cccactcccg 
ccacacagca cttggcccgg ttctgtgctg 
tgggtgtggg acggaagcag accctagtgg 
gtcgctggag taggctgagc tgggagctgg 
tgtggcctgg cttggtgcct gataaacaca 
gacgggttgg tgatatgaga gtattctgag 
aaaatgaata ggaaaggaaa gagtggtgaa 
gcctgttagg aaggcagctc tgtgcttgcc 
tggagactct ggagagagcc ctgctgtccc 
gagccctcgg gggtgcctga ctgggcatgg 
ctctccctgc tttttaagaa ccgaggagca 
ccaggccagc aatggagcag gtacaccctg 
gggtggctgc gcgtcacctg ctggttctgt 
cctcccccag caggtgggca ggcggcaggt 
tccgggaatc tcctcccgtc agaactgggt 
catagtcgtc atcacaaatt tccagtcact 
gaatacacac gtactctgtt tcagccttgc 
ggtcactgcc gctgccctgg tgcctccccg 
ttggctgtga gtggggagtt gggggttcat 
tccctgagtg cctgttcttt atcttgcccc 
gtacctgggg gtcagcaaac tccagccccc 
ttcatgacat gagaacgtaa tataaacctg 
aacacagccg ggccggttca cttacggctg 
ccccttaaaa actctgccag gctgccttgg 
ggccccatcc tccctgcccg ggtggccttg 
tggctcttga gcctctgccc tgagcctgca 
tggtcccagg gcccaaaact ttggcgaggc 
cttctgaagg gagggactgg cttgcacccc 
ttcccgccgt ggaggctgct ttccagaggt 
agcacgacaa caccgtgcgc tcaccctggg 
aggacacagc ctcccgctgc gccatccatt 
cagtgacctc tccatctgct ttctgacaac 
gtctgaggtg ggccgagcag cctggatgga 
ggcagagagc tggggacagg gatgaaaagc 
tggggcagct ctgctgtgtg ccggttggtt 
ttcaaagtcc cttgtacagg gacccttcat 
gcctctgctg gccgcgccct tgcagtgagg 
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14821 cccctccctg ccctgagatc ccggagccga ccccagaaca ggcagtgcag ccctgccgtg 
14881 ccctcaccgg gtccctgccc cctgcagagc gccggagcca ggggctgtcc tcgcgcctgc 
14 941 agaagcggtt ctacgagcgg tttggggagt acgtggagga cttccggttc cagcccgagg 
15001 agaacactgt ggagacagag gaacccctga gcgcccgcag gtaggggttc gccgagctgg 
15061 ggctgcctgt gcgttagggg ccccggtccc tgggctgagg ctgcagcggg tgggccatcc 
15121 tetatcaccg tgggcdaaag ctgtctgtga ggctgccgag agctcctgga ggctccgctg 
15181 tgtctggtct tctgggggcc gtgcctggca ggggcttgcg aggcccgctt aggtgcagca 
15241 gagctgcatg cgcgcactgc tctcagctgt ggcttgtgac agttgtgtgg 99«<*catgg 
15301 ggctcagccc actgcctggg gccttccttg gctcttgagt cccatcagta aaccttggtt 
15361 cccctgccag gtgctgccat aggccctggg ggtcccgagg gagctaggtc a9c«aggcc 
15421 ctgcccttga ggacttctgt cctggggaca caggcaggag atgaggggac gatttcattg 
15481 gcgagtcgcc cctgccttcc taacttgggg agtggcactg agggccatcc ^cgcaccc 
15541 ?gctgtgctg agccgccgtg catggcoatt ctgtgcactc aaggggtccc aggctgttga 
15601 gacgtcctcc ctggcccctg cagagccaac ggaaccgggc agtggggact gcagagccgg 
'5661 cgggtgcatt aaccctgggc tcccatccct ccccttctct cttaggttaa ctgaaaatat 
^5721 gagacggctc agtgagtacc agcggctctg cgtggcgccc acgatgctca gtgtgggtgg 
15781 ggcccatggc ggtggggact ctgggattct gagctgaacc gactctccag ggagacgtgg 
15841 tLagcggcc tcctacccac agtgctgacc actgtggtct gctctgtccc cagagcgcgg 
15901 tgccaagccg gtcactaact ttgtgaagaa cctctctgcc ttatccgact ggtactccgt 
15961 ctacacgtct gccattgcct tcaccgtgag tgggtcctcc aggggccgag cagggtgggt 
16021 gggacgtgcc tgccagcatt caccacatga tcaggcacct tcccagggtg ggaggtggcc 
16081 cccaggtggc acttcctcag gccccaggga catgctcatt tctccttctg taaagtgggc 
16141 tcactgatgc cagtgcccag ccatggggtg gcttcaagga tcaaagacgt tgggcgtggc 
16201 ccggggctag cctgcagggc tccctggaca gggatgtggt gcagccttgt ^=" a 999t 
16261 gcccttggga aaggcctctt ggcactcacc gtgcagtgcg gtggccacgc tgctgaggtc 
16321 cacagagtac ctggtgctgg ggccaggagt acctccaagg gcatggcagc acgaggggcc 
16381 cagcccagcc cctgtgagcc atgagtcctg agggattgga gggtgccagg gcccacattg 
16441 ccagggcctt ggggaaaagg tgcccagccg tcctgcctgg aactgccctg cagcggctgc 
16501 ccclgcggga caggagactt gccacctgtg tctcagtact tggtcctcgg ggggagcccc 
16561 ctcacccagg atggagactg tgcctccagc acgaggaaga tgcaggcagc "^^gctt 
16621 gctagggcct ccaggccaga gaggaggcgt ccacagcctg gcactgtcca gggctcccaa 
16681 aggcagcact gccctgctga gcgggcccct tgggagaggt ccccagatga "tcagcctg 
16741 ggtcaggctc tccaggaaag tccctgaagc ctgtggccct gcgcctgctg cgggcactta 
16801 ccactcacct gggacatggg cactttgttc actcacagat gaaggctggg tgctgggacc 
16861 accggcttca ccctggccct ggaaggcctc ttctcctgcc accctcacaa atgctttgog 
16921 acaggaggcc tttgctgcct gggct.cccac acgctcctgt gccatgtctg "gtcactaa 
16981 gagatgagcc caagctttga acctgccttc caggcctcac ctccatcccc tgccctgtgc 
17041 acatccgctc tgcgtcccgg gctctgtgct gggcacagat ggtgcctggc atgaaggtgg 
17101 ccccgggcat gcacctggct cgccctcagc agtgcttgcc tgcctcgccc agccaggctg 
17161 agtcctggcg atgccccaag tctggtgcgt gggagccaca ccggggcagg caggacccgc 
17221 atgtggaacc agccgccctc ttctgtcccc aggacacgct tctccctagg tgtctcctgg 
17281 cttccaccgg cggctcttaa accggggaag cagcccttcc tccggcttcc ccatcagacg 
17341 caccctcatc cctgagtgtc tggagagcca gcgggtggcc tggtctcccc actggggccg 
17401 cctggtgggg ggaggcggga gggatgtgcc ttgtcctgat ccaagctgtc aggaggtctg 
17461 acgccctgtc tctcacccgc aggtgtacat gaatgccgtg tggcatggct gggccatccc 
17521 attgtcctta tttctagcaa tcctgaggtt atccctcaat tacctcatcg ccaggtaggt 
17581 gagccggttt gttgcagctg ctttatccca ccgcatgtat ggctgtgctg tctccctgcg 
17641 tctcacaacc gtgaccccag ctgggtatgt catgcactga cgttttgggg gtggtcccct 
17701 gacaggctcc agagaccccc caggcagggc ggctgccccc tccctggttt gctagagatg 
17761 gcagtgggaa gttcccaaat cccccagcag ccccttcccc gggcggccag cagagccagg 
17821 gccgttccag cacctgcccg aacatcgcag ggggcttctc cccaacatca gtgggtaggt 
17881 gggtgaagca aacagggctc cacgccccgg aaggttcctg ctggctgcct gttggagatg 
17941 agcctgctgg ggaggagctg gaccgggaat gagtaccagt cgcctgcaga cagcacgccc 
18001 gtgtgaccga cgcgttaacc ccaaccccgg cctgagtcag ggocctogtg tcacccctta 
18061 tgccccctct gagcaccccc tccctgtgga ggctgagggg ggctctggga agcccatgcc 
18121 ctggggtgga cccctagtcc ctgcactcct gggtccttct tagcctgcgg ttacccctgg 
18181 agtgaacccc agccttgcac cgaggctcag ggaagagctg tgattctgta ctttgacttg 
18241 gccgttcaaa agtttgcact tttatgcacc agtgtttcca gatggttctc tgtgcctgcc 
18301 ccagttggct catttgaaaa tgttttttgt agaaatcact taaatactct 99«39tgtc 
18361 gttggtaccc agagctaggt caggatcagc ttggacactt gctgatgcca ccttggatgt 
18421 tgaagggccg ccctctccca caccgctggc cacttttaaa tatgtcccct ctgcccagaa 
184 81 gggccccaga ggaggggctg gtgagggtga caggagttga ctgotcccac agcagggggt 
18541 tccggaggga ccttttctcc ccattgggca gcatagaagg acctagaagg gccccctcca 
18601 agcccagctg ggcgtgcagg gccagcgatt cgatgccttc ccctgactca ggtggcgctg 
18661 tcctaaaggt gtgtgtgttt tctgttcgcc agggggtggc ggatacagtg gagcatcgtg 
18721 cccgaagtgt ctgagcccgt ggtaagtocc tggagggtgc acggtctcct ccgactgtct 
18781 ccatcacgtc aggcctcaca gcctgtaggc accgcCcggg gaagcctctg gatgaggcca 
18841 tgtggccatc cccctggagt cctggcctgg cctgaagagg aggggaggag gaggccagcc 
18901 cctccctagc cccaaggcct gcgaggctgc aagcccggcc ccacattcta gtccaggctt 
18961 ggctgegcaa gaagcagatt gcctggccct ggccaggctt cccagctagg atgcggtatg 
19021 gcaggggtgg gggacattga ggggctgctg tagcccccac aacctcccca ggtagggtgg 
19081 tgaacagtag gctggacaag tggacctgtt cccatctgag attcaagagc ccacctctcg 
19141 gaggttgcag tgagccgaga tccctccact gcactccagc ctgggcaaca gagcaagact 
19201 ctgtctcaaa aaaacagaac aacgacaaca aaaaacccac ctctggccca ctgcctaact 
19261 ctgtaaataa agttttattg gcacatagac acacccattc atttacatac. tgctgcggct 
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gcttttgcat 
actgtctggc 
cactgggcgt 
tgcctgtttc 
tgcccgccca 
caggaccagg 
gctgtgaagg 
ccactcctgt 
ggggtgtcct 
acagcagagt 
tgttaaattt 
cctgactgtg 
gccacgggcg 
ctacgccggc 
tgacactgca 
ggtcgttaca 
gtgcgcgagg 
gggaaggact 
agagctgggc 
tcagggctgc 
gggggctgtt 
atcctggaga 
ggcctgagcc 
aagctgtatg 
gtggggcttg 
caggtgtccg 
gcgatccttc 
gagatcgagg 
catgtctcct 
ggcgccaggg 
gacagtcatg 
gagtttggtg 
ctggcggcgg 
gggtgtggtg 
cttccctcct 
ttctttgttt 
tcgcccaggc 
tcaagtgatt 
gcccagctaa 
ggtctggaac 
acaggcggga 
tcttttcttt 
gcatgtggga 
cccacgcagg 
gtgccggagt 
agaggagcag 
aatgccttta 
aggacaggcc 
tcaggtggca 
gccctggagc 
ctgtgacctc 
tgcacttgcg 
tgcacctggg 
tcagtctgtc 
tcccctgagt 
gcaccaccac 
ccaggctggt 
gattccaggc 
ggtttgggtt 
caggtgtggg 
tgacaaggga 
gctgggactt 
aggcctggga 
cctagcatct 
agcactttgg 
gccaacatgg 
gcagtgagct 
caaataaaaa 
gtagtcccag 
tggcagtgag 
ctcaaaaata 
tggtaggctc 
gcagtgagct 
caaaaaaaag 
ttctgtgaaa 



tacccttgag 
cctttacaga 
gggcagaagg 
ttgcatggag 
gcactgggac 
gctcccattg 
tgaactgtga 
tgtgcaccca 
cctggtgcca 
ccctggggct 
cagtggaaaa 
tctgagaagt 
ccggccaggg 
caggggtagg 
ggcccggttt 
gcaactctgg 
gtggagctcc 
caggcccggg 
cttccccggc 
ccgacctgga 
gtaattgctg 
agatcaagaa 
tcccgtctcc 
tggcgctctg 
ccgtgggtaa 
catgccacca 
ctctgctttt 
gacacggtga 
gcaaagtggg 
aattgccctg 
gagagcctga 
acatcacgtg 
ggaaggaggg 
gccccccctc 
ttctcttttc- 
ttgtctctct 
tggagtgcag 
ctcctgcctc 
ttttttttgt 
tcctgacctc 
gccactgcga 
ctcttttcct 
gcgaggccgc 
acagcgtgtg 
cgggaggtca 
tgggggctgt 
ccataacgca 
tctctggacg 
tcaccaggag 
ccctctggct 
ctgttggacc 
gccctggaca 
tcttggtgtc 
tcacccaggc 
ttaagtgatt 
actcagctaa 
attgaacccc 
gtgagccact 
ctcacaacca 
ccaagatggc 
cctcagctta 
gggctctctt 
gctcagtcca 
ttcaagaatc 
aaggccgagg 
tgaaaccccg 
gagatcgcgc 
aattaaaatt 
ctactcaggg 
ccaagatcat 
aaataaaata 
ccagctactc 
gagatcaagc 
aatcaacaag 
atgtcctaca 



tagacgacag 
agtctgctct 
gaggcctcgg 
caccaggggt 
ccaggagacc 
gctgggcctc 
ggcagagcct 
gagtgagggg 
ggtccccaag 

gggagggctc 

ttctctttcc 
tccagctggt 
gtgtgtctgc 
tctccgccgg 
gtccgcggtc 
tggttgggga 
cctccctccc 
tctgctgttg 
ctaggagcag 
ggccctcctg 
tttctccttc 
gtaagtcccg 
ttccagcttg 
ggctgccttc 
gtagatgcac 
gtgtctgcca 
tgtcacaaat 
ttgtttctta 
aggagctgga 
ggcaggacat 
gagcctggtt 
aagccaaggg 
tggctggcgg 
accccaggag 
ttttttcttg 
ctctctcttt 
tggcgtgatc 
agcctcccaa 
atttttagta 
aggcgatctg 
ctggtctctt 
caataacaac 
cagggtcctg 
cggctgtagg 
gaccccggcc 
gtctgtgtgg 
agcccgaccc 
cagcacggct 
tgggcagagc 
agcacccagg 
cctaggcctg 
gctccggggc 
ctgcatctgc 
tggagtgcag 
gtcctgcctc 
tttttgtatt 
tgacatcatg 
gcgcccggcc 
ccctgagatc 
attcctggct 
gaggggcaga 
gagctgagct 
gtcagccaag 
aacaagttgt 
cgggtgaatc 
tctctaccaa 
cactgcactc 
aaaattaaaa 
ggctgaagca 
gttgctgcac 
aaataaaata 
gagaggctga 
cattgcactc 
ttgagaattg 
aaggcaaaaa 



accacgtggc 
agagggagac 
tgcctccacg 
cagggcaagt 
ttgagaggct 
agtttccctg 
ctgctcagcc 
cacggggtga 
ggatgtccca 
cacactgttt 
ccttttactg 
gctggacgtc 
gccagccatg 
cctccgctgc 
agctgacttg 
aggggcctcc 
caccgcccct 
ctgtgagcgt 
ccgggcagga 
gcgtcgcggt 
acacagaacc 
ccccccaccc 
ttcatgtggg 
ctggcctcct 
ctgcggcctg 
ggtgtctgca 
gtttcaattt 
aatcaagaag 
agacccaaga 
gcctcttggg 
tgatgtgcag 
tctcatccac 
tgaggaccgg 
gcgctcatcc 
cttccctcct 
tttttttttt 
tcggctcact 
gtagctggga 
gagacggggt 
cccacctcgg 
ctgttttgtt 
gatccagtgt 
tccttgaggg 
agccacaaag 
cctcagcagt 
aagagcagat 
tagccctgac 
gagctgtcca 
cagatctgat 
actctaggct 
ggtccaggtg 
aggccgccct 
cctgctgtcc 
tggcacgatc 
agcctcggga 
tttaatagag 
atccgcccgc 
cctgctgtcc 
accactgcca 
tatcggtgcc 
ggctgcctgc 
gtccctcctg 
ggggtggtcc 
ccggacacgg 
acttgaggtc 
aaatacaaaa 
cagcctgggc 
aaaattagcc 
ggagaatggc 
tcctgcctgg 
aaataaaata 
ggcaggagaa 
cagcctgggc 
gggtcaaaaa 
atgaaacaaa 



catggaagcc 
cccggcccat 
ggcctagttg 
acctggagga 
cttaacgaat 
cctgtaagtg 
attgcagggg 
gatgtcacca 
tcccccctgg 
tgtcagtggt 
aaggaacctc 
gcccagaaag 
ggcaccagcc 
tgcctgggga 
tagtcaccct 
tgattcagcc 
ggccagggtt 
ggccacctct 
ccacagggct 
gtgactgaca 
ttttcgggaa 
cggccctgcg 
tccagccgga 
gcttcttccc 
tagcttcagg 
agagctgtta 
gggagaagtg 
ccctccactt 
cccaggatag 
gcgcctgtgg 
ggaggttggt 
taaaacactc 
gctgtgccgg 
actgctgccc 
ttatctttgt 
ggagacggag 
gcaacctccg 
ctacaggcac 
ttcaccttat 
cctcccaaag 
ttgtcttgtc 
ctgacatggg 
gcaggtggga 
ccaggggtgg 
ctaggcttgg 
gccccctgcc 
ccagcgtaac 
acagcagctc 
cagaccctta 
catgaatggg 
ccgaggctcc 
ccacggggag 
ttttttttca 
tcggctcact 
gtagctggga 
acgaggtttc 
tcggcctccc 
ttttttgtga 
ctgggcagcg 
catgtattct 
tccaagcaag 
ggtcggccac 
tggccctgcc 
tggctcacgc 
aggagttcga 
tgtagccgcg 
gacagagcaa 
aggcatggtg 
gtgaacccgg 
gcgacagagc 
aaaaatttag 
tcgcttgaac 
aacagagcaa 
gtctgacttg 
ggcacagctt 



aaaaatattt 
ggggcaggac 
ggtatctcag 
ggcaggctgt 
gggagacaag 
agggagggca 
cggctctgcc 
tcagcccata 
ctgtgtgggg 
ttttctgaac 
caaaggaaga 
cccaggtact 
aggggtgtgt 

gggccgtgcc 
gcccttggat 
tctgcggacg 
gaacgcccct 
gccctagacc 
ccgagtgacc 
gcccaggagc 
gatggctgac 
gcgcccgccc 
gatcacacag 
ctaccgcctg 
aggagggaca 
aactcacctg 
aagcccatcc 
ccaaggcccc 
caggacttgg 
gcttaggggt 
tgtagctgat 
atctgcagag 
gaggtgggtg 
agcatgctct 
ctttctcctt 
tgtcgctctg 
cctcccgggt 
ccgccaccac 
tggctaggct 
tgctgggatt 
ttgtttcttt 
ccacacactg 
ctcgcctgag 
ggagagctcc 
cgtgctgcgc 
acgtttcctg 
tgctggtggg 
actgtggcca 
ccccgacaat 
taggtgacca 
agagctgcag 
cctgggggtg 
agacagcgtc 
gcaacctcca 
ttacaggggt 
accatgttgg 
aaagtgctag 
cacccagtgt 
gtgaccctgc 
tttgcagtgc 
gatggcccca 
cacgtggagg 
cctcctcaca 
ctgtaatccc 
gaccagccca 
tgtggagctt 
gactctgtct 
gcaggcgcct 

gaggcggagc 

aagactccgt 
ccgggtgtgg 
ccgggaggtt 
gactctgtct 
attgatgaat 
tatcagtccc 
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23821 tttatagcac agatagtaca tggaactctg accctggggc ccctgcaggg agggcctgcc 
23881 acgtctggtc tggtctggcg ggctgtgcac tgggggcagc gagacagccc ccagcactgg 
23941 gaggggcacc cggagagacc ggaactcctg acagcttttg ctgcccaggg gggtgcgtta 
24001 gcaggtgtgt gggggccggc cccgcctccc atccctgcat ctgttgccag gtggggtcct 
24 061 ggggcgcagt caggcctggc tcttctgagg tcgtgtttag tagggggact gaccatgatt 
24121 gtgtcaggtg acagtatcgg ggagaaaacg gagctggtgg aggggaactt gcgagggcgt 
24181 ggctgggagc cgtctgagca gctggtcatg ggcggcctct cagcagggat gggctgggcg 
24241 tgggggggca cccagttgaa gaagggcagg tgggaggggg tttggggagg gattgccatg 
24 3 01 gagggacctc agatttggtg tcaggtgtgg ggaattgggg cagaggaggc acagtcctga 
24361 agtgtttcag gtgggtcaag aacagccctg agccaggcgg agttggagag agaagtgagg 
24421 ggcattggat gggggggtcc tagcagagct gaggggtctt agtgggagtg ggtggggcgc 
24481 tggcagtggg gccctcgcct ggggttgaga ctgtgcagca cgaatgggcc ccaggggagc 
24541 gaggctgggg gtatccaggg tgaggactga gcgagcagct ggactctcac atccaggctc 
24601 agggtggagg gtgccaatct gggaggtggg aggtgccgga accatcaccc tgagtagact 
24661 ggcatagctg cagagctcta gatggagcag gctgtgtccc ctgccctcag gactctatgc 
24721 tggtatcaag ttcttcctca ttgatttcat ctttaaacgc tgcccgaggc tgcgcgccaa 
24781 gtacgacacg ccctatatca tctggaggag tctccccacc gacccgcagc tcaaggagcg 
24 841 ctccagcgcc gcagtctcac gcagggtgag cccggccccc agctgcgggg atggggggat 
24 901 ggggggccac gaagccgggc atccccaggc ccacagtgct ctgttcacct ccggctcatt 
24961 tagaggctgt ttcttcaatt ttatagactc cttaaacttg agggtttttt ttttccctgt 
25021 taccactgcg gtttctgagg ctcatccagc ccctcccaag cccagggcaa cacctgggga 
25081 ctggccatga tgactgagag ccagcaggcc tgggagtgac cactccatgg gggtgcccgg 
25141 cccctgccat ccctgacagt gaggctctgg cgctgagacc aggcagtttg agaagggccc 
25201 ccttggagcc acagcctggc actgggccat atttcattca cagtcacagg ccctccagat 
25261 gtgcagcaca gcatcccagc caggaccacg gggaatgagc aggcctcggt gctcccttcg 
25321 gggtggctca gatgagtcag gagggttgct gggctctccc cactgaggtg aggtggccct 
25381 gcctggaaga aggagccgct ctgggccgga agggacctcg ggaacgtctt cccaaatttg 
25441 gatcccagct tttggggagg tgtgtgtgcg gccagcacat ctgggtgctg caggcgtggg 
25501 cagcgggcct gacctcgacg ctgtttgccg ttggcagctg cagacgacct cgtcacggag 
25561 ctacgtaccc agcgcaccgg ccggcctggg taaagaggag gacgccggtc gcttccacag 
25621 caccaagaag ggcaatttcc acgagatctt caatctgaca gaaaacgagc gtccgctggc 
2 5681 gggtatgtgt gccgtgagtc tgaggccagg ccgagcgccg acacagactg aaggcccgtg 
25741 aggggggcgc tgtggatgca ggacggggtc gccctgtgac aggccagagg agccctcagg 
25801 atggcattca catcccactg ccccaggagg agtgcaggag ccaggggtca tgaggactct 
25861 gcgggctaca ggggaggagg gcgtgcccgt ggggagagcc gcccacgctg cagcactccc 
25921 ctgacagcag gcgtgcaggc aacacactgg ggagagagtc cagggcgagg cctcagggcc 
25981 tgcagggcgc tggcaggggg tctgcccctg gttccatcgt gtctggagaa acaagagcca 
26041 ccaggggaca ggtgactcag gatcctgggg acaggtcagc tttcccaaag ggagggcctg 
26101 ttaaactaaa ccgggcagct cgaggtgtgt gatgagcggc tgcagaggca gctgcggaga 
26161 acggagcttc gtcctctgct gagccctggc cctgccggtc ttggtggagg tggtgtccca 
26221 gcgccctgac cctgccggtc ttggaggtgg tgtcccggcc acactgtgac ttctgttgct 
26281 ccctgctctg cctgggtgct tttactttaa aaaccctgcc agtgcctgtt gaagaggaaa 
26341 catctgtttt ccaccctctg gggccacctc actcttgccg gcgcgccttc ctctcactgt 
26401 gggtgtgacg tgagcgcggt gggcggatgt caggatctga gagtcaggct cctgggtccc 
26461 agcgtcaggt cagaggctgg gtacttccag tggaaagtgt gacaggcggg ccttctccca 
26521 ttagtgtgcg agaatggctg gcgctgctgc ctcatcaaca gggaccggaa gatgcccacg 
26581 gactacatca ggaacggggt gctctacgtc acggagaagt gagtgcagcc gtggggccct 
26641 gtgtggctgc aggggagggg gcgcaggagg ctgcctaggc cctgggatgg cgtggctggc 
26701 ccagtgcagg ttttcctgag tggtggccat ggcctctccc atgctctgct cttgccggct 
26761 gtctgggctc ccagggaaaa ggtgttgtct ccaagccgga cgtggcctgg gctgtactgc 
26821 tgccatccct caggctggcg gggatgctgt cttctctggg gcccaagcag ttcatggaga 
26881 cgaggggcag agtctcccaa ggaggggacc gtgtggggac ctgctattgc tgaggccctg 
26941 agtccttgtt gccctgagtg cctcatgcag agcagtggcc gtgagtgtct cactcagagc 
27001 agtggctgtg aatgtcttat gcagagcagt ggccgtgagt gtctcactca gagcagtggc 
27061 cgtgagtgtc tcagagcagt ggccgtgtct cactcagagc agtggccgtg agtgtctcag 
27121 agcagtggct gtgagtgttt cacacagagc agtggccgtg agtgtctcat atagagcagt 
27181 ggccgtgagt gtttcaccgg ttctgtgtct gctgggaagg gagcaccact gtccatccca 
27241 ctgggggctg agaggcagcc ccgggagacc cccacctggt gggagcgtgc tctggttcaa 
27301 gagccaggcg acgcctctgt ccgttcccct tccagttact tgtgcttcga aagctccaaa 
27361 tctgggtcct caaagaggaa caaagtcatc aagctagtgg acatcacgga catccagaag 
27421 gttggtgcac ctacccaccc ccactaaccc ccgtgttttc ttcgtcacct gacagaggct 
27481 ggcgaggctt tccctataga cttctgccag cctctgtcag catctgggcg ccgcggccac 
27541 gctggccggc catggtccgt ttccttgact tgagtcccca tgtccctgtg ggtctgcccc 
27601 tgccctgggg tccacttgcc tggaccgtgt gtctcaaagt ggaggggctg ggttggagcc 
27661 caggaggtca ggacccaagc cccactctgg ctgctgtgtt cccctcgtat ctgggaccct 
27721 ccccataaag aaaagcacag gggccctgag cccctgcctc ggatgcaggt tttcaggtgg 
27781 gagctcagat ggtcgcgccc gctctgggca ttttctcagc attttaccgt agcatggaat 
27841 gagcaggtgg gcgtggccac aagtgcctgt ggtcccttgt tttctggggc agggaaggct 
27901 cagtgaggca ctggcaggca cacttggccc gagtggtcag ggcccccagc tccctcccag 
27961 ctccgagcgt cctgggcatg tccgcattgg cacggcgggc actcccggcc ccacggcagt 
28021 tgtgggtcat ctctcccacc cggctctgga aaaaagcgtg cagtcaggtg accagaaggg 
28081 gtgagttcag ggctggggtc ctttccgcct gcctcttggg attggcggtg ctgacagatg 
2814 1 gccctcctgg tgccaggccc tgagagcgag ttctggggct gagtgcctac cgtcagcctc 
28201 ctcaggccgg cttgtctctg accttcctca ggccagctgg ccgacggtct gggctggggt 
28261 gggacagacc ctttacaggt ctggtgtctc ggccacaaag tgaggaggtg tccacaggcc 
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28321 ctcacctcct gttgtttgga agtcccaggt 
28381 tggaggagga tgccctgggc ctgtggaggg 
28441 tggtgacggc cacgctttgc ctttcagtac 
28501 atggggattg ccgtgtcgac gccatccacc 
28561 aaggggttgg tgtgggccca ggggcctgac 
28621 gacagccaga gttttagttt gggcagcagg 
28681 ctagagcctg gggctgcgtc caggttgggt 
28741 ggacagacag ggatgttggg cacaggggtc 
28801 cacggggcta aagcctggct cccctctgcc 
28861 ctggctccgg gtggtgggga gacacctgtc 
28921 ctggcctggg gctggtgttg gacttcgtct 
28981 gccatgctca ctggtggcat ttcttccctg 
29041 cagggatgag gccttcgaga ccattctcag 
29101 gtctggcggg gacagctagt attgacttgc 
29161 tttctttttc tttttttttt tttacgattt 
29221 gcttttgcaa taattctcct ggacctgtgg 
2 9281 accaagaagg ggccagggct cacagggacg 

2 9341 gtgcctctga gggccacccg cagactgggg 
29401 agaagacaca caggacccct ggccctgccc 
29461 ctccagccac cccaagaggt tctcgcaacc 
29521 cgtccaccac caaagccata gctgaagagt 
29581 tgtcgtcaag gagggacatg ggggcctttc 
29641 ctccttcctc ttacatgaga ccctcctgtg 
29701 ggcgcagtga ccctgcctgc gctccacact 
29761 gcccacccct cgcctgccct cagtgtcttt 
29821 gctcagcacc gcgtctgtaa gggcctgcct 
29881 ggtctgtagc tggggaccag taagggcaca 
29941 acactttgct ctttggaagg cccaggagaa 
30001 gcactgtggc atgtctggca catggccccc 

3 0061 ggaagtgggg aggagtgttc cgggaccatg 
30121 tccgagaggc tctggcagcc cctgtgcttt 
30181 ctgggcctgc actgcctgca gccgacatgc 
30241 ggtgccggtc cgggcacaga cccccccagc 
30301 cccgggaaca aggtggcatt tgtggaggga 
30361 ctgcgctggg ctgaagggag ggaaaggcgg 
30421 cgcgggcctg cgtgtctgct ggggcggatc 
30481 ttgggtgtcg ttgagattgt tgttttttga 
30541 gcaagctggt tttcttattt ttgtatcctt 
30601 tcctcatttg ctggtgtggg tgagggatcc 
30661 gaggggccgc agagtcacac gctggtgccg 
30721 ccagattaag cacatgtcct atcccaggcg 
30781 cgcggacgtt ggcgtcagga tgaccacacg 
3084 1 ctctgagccg tgggtctggc tcctgtaggg 
30901 ggacatatcc cagtgaaccc caccttggcg 
30961 acccccaggg gcggctgcag aggcagtgcc 
31021 ggcccggcag tgcccaagga tgggtccggg 
31081 gtgctgggat tcagtcccca aacacagcgg 
31141 cagcgtcggg tggtttagtt cgagtccctt 
31201 cgcgccaggt gtggccgaag cccccaggga 
31261 gagggccacc gactggctct gctgccagca 
313 21 gagcgcggat cggcacgggc tctgggctcc 
31381 .ttgtgtacat ctgggcagat gtttaatttc 
31441 aaatttttta tgtctgaagc ctgagtctat 
31501 gacttttgtt caacaaaagg attgtactgt 
31561 aacatttttt taagaaaact ttgtttgttt 
31621 ttatttaaat aaacctaaaa tgctttgtgc 
31681 gggccacacg cattgtaggc ggcctgagcc 
31741 ccgctgccag cccgccactc cctggggccc 
31801 ctgatgcggc ctgagtgacg gctgtggggt 
31861 gacgtgggca ttggaggctg ggagcagggg 
31921 caggtggcga gggacgcggc ctgcagggag 
31981 gggaggttga ggcagaggca gtgccttggg 
32041 ccaccctggt gtcctggctg cctgggggtt 
32101 ggcttggtcc atctggatgc tgagactaga 
32161 ttgaaacccc tggagttggg catgagcctg 
32221 ctgtgcctca agcccacagg agcccccttg 
32281 agggagctgg agccaggcgg aaactaggtc 
32341 cttcggaaag taggacatat gcgtactcca 
32401 catccacatc tgtccgtccg tccgtccgtc 
32461 tccacccacc cattcatcca tccacccacc 
32521 cccacccact catccatcaa tccatccacc 
32581 tccacccacc cacccatcca tccacccacc 
32641 tccatccacc cacccaccta tccattcacc 
32701 tccatccatc catccatctt tccacccacc 
32761 tccatccatc catccatcca tccacccatc 



gcccgtgggc cctgctgcct gtgtgcccag 
ctgtgcctcc ctgtccccag gagagacctg 
aaggtcctgt ctgtcctccc aggctcaggc 
cagaaagtag gtgccgggcg gggggccgcc 
tctgagcaac cctgggacca gcgtggggca 
gtcacaaagc cgccccctcc ctcgcagcag 
tggaaatctt gctggggcca tcgagtcacc 
ccctgactca gtgacttggg aggagggagg 
tcttggctct gttcccctga aggctggagc 
ttttgcaccc agacccacgt gactagcgtg 
cggtcctggt cctggcgcca gcctggctgt 
ccagccgctc gtgtttggtg ccatggtgca 
ccagtacatc aagatcacct cagcggcagc 
ccaggacgtt gctggaattt tctttttctt 
ggtagtggaa acaattggac atcctcatga 
ttctattgtg ttgacctctg cgttttatcg 
ggggtgcccc tctcccacag ggcacgtcag 
gaggggg ca 9 aggccctcgg gggcccgtgg 
ttctccgttc cagcctggac agagaaacct 
ttgtgtcccg ctctccagag gccagaagct 
gcggggccct tcctcctggg gacagaaaga 
accaaccacc gagaaacggg cctggcggcc 
gcatttgccc ttggtgccgg gctggggccg 
cgctccacgg gaacagagag ggtgagaagg 
ggtggcacct tccttgctgg cctccagggc 
gctgctctcg gcctgacacg ccggccagga 
ggatggtgca ggtaaaagca catctttctc 
catccgcgaa ggctgttgga ggtgctccga 
aggctgcggt tgcctgggtt ggttggggga 
gtggcccagg ctgcagccgc ctttgggcca 
agggagcaac cgtgagccga gcccagaggc 
gacagcgttc cctcccccgc gtgcctagcc 
ccccgccctg ccccagggaa gcctgggctt 
gcgcccgcag gcctggtctg ctggggccgc 
cttgggcctc ctggaaggag gtggccaccc 
ccgcagctcc ctcagcttgt cctgagtccc 
agaaacagaa gattctattt tttacagcga 
tttcagatgt aatttttatc tttgctccga 
ggcggcatgg gctggtttca cccccttcac 
ggggtgcttt ggggggagct gcgccgatca 
gtggagcgga gcccccgtgg ctctggactg 
gcggcctttc ccgaatgggg acagaacccg 
gactggctct cttggtgcac caggggaggg 
cctgaggcaa cacagggtgg gcactgaccc 
cgcagacaat ggccacacct ctctccccag 
gcctcggggc caatgagcgc ctcttcctag 
gaggggtccc tggggcagat ggggctttac 
ttgtggagaa agggagatga aaactgacca 
gggccacatt cggggagcgg ggggtcgggg 
caggcccctc cctggaagtc ctcgggagcg 
ccgtggagag aagctgtagt ttttaccaaa 
tgtgactaat cactgaacta gacgaatgtt 
tttggatctg taaataatca ttgccagtgt 
attaagaacc gatgaaaaaa attctcctgt 
aaagaaaaag tattgtataa attataattt 
taaggctcag ggctgtctgc ctccttggca 
agctgggcca ccaaccccag gccctcagtg 
cacggctgcc ggcggcgggg cgtttgggta 
gaggtggggc tgagaagcag agtcctgttg 
gcagggcctg agttccagaa gacaggaggc 
ggtacccagg gctggcacag ggcgaagtgg 
gtcagggggc atgcaggctt tggcccctct 
ctggacacag gggtttccag caacacttgg 
cacagaaggg tgcaaggacc tgcctggctt 
tgccaggtgc acctggtgct gcctagcctc 
gaggcagaga cctccccagg ttacacagca 
tagtgggcca cgccacccta aggggcacag 
tcgacatagc cagccactga tccatccatt 
cgtccatcca tccatccatc catccatcca 
cattcatccc tccatccatt catccatcca 
cacctattca tccacccatc catccatcca 
catccatcca tccacccacc catccatcca 
tacccatcca tccatccatc catccatcca 
cacccaccca cctatccatc tacctaccca 
cacccattta cccacccacc cacccaccca 
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32821 gccagccaag tccttggtgc tgggacacag cagtgatggg ctaggaacct gccaaacaca 
32881 ccctcacatc cttcccagga acacatccat gctgtgatag ccaccgaccc cttccaccca 
32941 cccctctgtc ctctgcaaca ggtcctcccc cagcttgctc ctaacttgga acaaggctca 
33001 gacctaccta acccgttggc agcccacact gacccctcca cacagacagg gggcagtgtg 
33061 ggcccatggc aggtgtcact ggagccgggc catggaggag gcggaggggc ctcccagggg 
33121 cttctgagtg ctcaaaaagg ccaaccttgt gggagaccac ccaccctcgg cccacaggct 
33181 ctatgcctaa accaacaact tgcctgtttt gcgtatattt caccatttag tgtcaataaa 
33241 ggtgcctaca tatacctcta tcaccattta cgggtgattg tgataactat ttcagtgcag 
33301 ctggtttcct tggcaatcgt gcatgtttta atgcatgcat tttaccttac tcagcggcat 
33361 ctctcggctg tacctggcta ccagaggggc tgtgcaacaa ggaaaggtca caaacctgtg 
33421 gggcaggcct ggtgccctct aggacttttg ttctgggagt gatggggcac tgcagagctt 
33481 ccagcagaag ctggcactct ggctgcaaat cagaagaggg gtcacccagg gtgggggtgc 
33541 gtcagggaga gtccagaggg gaatgaaatt gttaatgatt ttattcttga accaggggtt 
33601 ctcttcatag gtacttgcta tatgttttta aatatttact ggatgaatgt aacggatgaa 
33661 tgagtaaata aatcagagag ggcctcacat ggaccaaagt gagtgacgct gggaacccag 
33721 ggttgtggtc cattctgctc gtcctgcaga actccaggag gcagcattct atgggcttat 
33781 gagctccagc agtgtttgga ggacagggtt ggccagagac tacaggagcc acagactaag 
33841 ggcacagcca aggaaggacc ccaagacccc gtcctgtgtt ggtgtgagct gggaaagccg 
33901 cccagggaaa gggctgtgca gccacccagt gctgctgaaa tcagaaatct gccctcaaaa 
33961 cccctggaac caggaagccc agctctgccc tcctcctggg gccctcgggt ccacgtcagc 
34 021 tggaagggag acacctggga tcacagaggg gcagctgggg tggggctgga gccaagaggc 
34081 agtgatgggc agctgaccac tccctcagac aggagctggc ccatgcccct gccatcctct 
34141 ctgctcaact ctctaccaca tgtcacagct ggcatcgcaa gctaggggaa atagctggac 
34201 catgcacgat tgtaaagaag acattcagct gttactatct gtaaatgata tgatgacaga 
34261 aaaccccgaa attcacaggt aaatcatcat cataggagtg tggaagctgc tggtacaaaa 
34321 tgagtcaggg gaggccgtga agaagctctc gaagacctgc ctgtagcaga actcacccca 
34381 gaaccctcgg gactgcgacg agggcgcccg cccagtgacg gccactcctg cggtgaacag 
34441 tactgcctcc tgccaggagc tgctgtgacc tgtgaacttc gttctagagc agcttacatg 
34501 gacttctttg ccttttaaag tgttgccttt atcgcgaaca ccccccaaaa tgccttccag 
34561 tgtagcacgc atatcctgac tgcaatcccc ttgctgttcc tgaataaact ctttcttagg 
34621 agaacactct gatgtcactt taggttgacg ggaggcaagg tttctggata taacccaaca 
34681 cataaaaaac aattgtggcg ggcagatcac gaggccagga gtttgagacc agcctggaca 
34741 acatggggaa accccatctc tactaaagat acaaaaaact agctgggcat ggtggcatgc 
34801 acctataatg ctagctactt gggaggctga ggcaggagaa tcacttgaac cccggaggca 
34861 gaggttgcag tgagccgaga tcacaccatt gtactccagc ctgggcaaca gggcgagact 
34921 ccgtctcata acaaacaaaa aattgcattt ctatgcaaaa caaaacaaat tttaagatag 
34981 cctcaaaggc caagtgtggt ggctcacgcc tgtaatccca gcgctttggg aggccaagat 
35041 gggaggttca cttgaggcca ggagtttgag accaggctgg gcaacatggt gagaccctat 
35101 ctttacaaaa cattaaaaag cctcaaaata agaacttttg gctgggcaca gtagcttact 
35161 cctgtaatcc cagcagtttg gaaggctgag gcaggaggat tgcttgagcc cgggagttcg 
35221 agaccagcct gggtaacaga gtgagacccc tgtctctaca aaaaatttta acaaaatagc 
35281 caggcatggt gttgcacgcc tgtagtccca gttactcagg aggctgaggt gggattgctt 
35341 cagtctgaga gattgaagtt acagtgagcc gtgatcacgc cactgtaccc cagcctgagc 
35401 aactgaataa gaccttgttt aaaacaaggt taaaaacaaa caaaagtcaa gttttaggaa 
35461 taaatctaaa aatttaaaga ctttcatgga aaaaattata aaataattaa aagacatgaa 
35521 tgaggatctt tcatggacca gaaaagtcag tttcacaatg caaatgtcct ttcgaagttc 
35581 ctcaaataca aatgcaaatg ccctctcaaa catgtggagt gggccgggcg cagtggctca 
35641 cgcctgtaat cccagcactt tgggaggtcg aggctggtgg atcacgaggc caggagctcg 
35701 agaccatcct ggctaacaca gtgaaacccc gtctctacta aatatacaaa aaattagctg 
35761 ggcgtggtgg cgggcacctg tagtaccagc tactccggag gctgaggcag gagaatggcg 
35821 tgaacccagg aggcgagctt gcagtgagcc aagattgtag tgagccgaga ttgcgccact 
35881 gcactccagc ctgggtgaca gagcgaaact ccgtctcaaa ataaataaat aaataaaaat 
35941 agacatgtgg agtgaaagaa gcaagttaca gaaaacaaca cgtttccttt ggataaggag 
36001 caaaacctgc aaaactaaac tcgctgttgc agatacagcc atggacagag gcaggaaggg 
36061 atgcgcttgg cgtgacctcg tagcgccctc ccagggggag cggggggact ccagagcagg 
36121 gcacagaagg cttcggggcg gcgttcagtt gcctggcata ggtggtggtg acaccttgct 
36181 tcattttatt cttgttcttt aaattgtgtg tgttttcatg aaaaatattc tgctttgaaa 
36241 ataacaggta tgtgtaaagg aaagaaagca agccaagtct gcccaagtaa ctttattcgt 
36301 gtcttctcaa acatttcaga tcaaataatg ccattttcta atcttgaaac cagcaagatt 
36361 agaaagagaa tgaggccatc ctttcactca gactttccca gttactcaag attgtattca 
36421 ggaacgaagt caaactcggc ctcacttctt aattagaata aactagaatc aggtccattg 
364 81 cataactgct aattgttttg caccagggag ccacaaagga agctaaaact agcctttcta 
36541 agttatccta agtttttact atctaaccag cacttctgat tttttttccc ttaaaactct 
36601 aaagtacagg tgttgtatgt ctgccgcatc tgaaaggctg ccccggaaat cagagcctat 
36661 ctagcgggaa tgtctgggca gtcaccccca tcctgcgcgc agatagcaaa gggctgctgg 
36721 caggtgtcag acacacggta ccgcaacagc agtcccgagg tctgagattc tgaatgtcag 
36781 caggtggctg gagagtatca ccagagccac tcaggaacca gccatgcgga atggctgggt 
36841 ttcttattga acatgaaatg cttcctcttt ggcctggact gttttgctaa aattgttagt 
36901 gggttttggg gcaggccagt gtatgggaaa cttttttttt tgttgggggg ggcgatggag 
36961 tctccctctg tttgcccagg ctggagtgca gtggcgcaat cttggctcac cgcaacctcc 
37021 acctctttct tgggttcaag cgattctcct gcctcagcat ccctagtagc tgagattaca 
37081 ggcgtgcgcc aacaagactg gctaagttgt gtatttttag tagagacgag gtttcaccat 
37141 gttgggcagg ctgatctcaa actcctgacc ttaggtgatc cacccgcctc ggcctcccaa 
37201 agtgctggga ttacaggcgt gagccaccgc gcccagccgg gaaacattta ccagggacaa 
37261 taaaggctta gaagaacctt ttggggaaac tttgtgagga gcacaattcc agagggccca 
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37321 cagcagccac agcctcctgg cgtgacacca 
37381 cacaggcagg cacagccagt ctggaagagc 
37441 accaggagca aacattggta gaaagagcca 
37501 tcattcatta ttgaaccatt tggcttgcac 
37561 ccaccattaa agctgccctt tcacctgatc 
37621 ttcctgtatg cttcaaaata taaatcaatg 
37681 tcaaacatga aatcactaag ggagcttctg 
37741 gtctacagcg tgctccaaac ggccacggcc 
37801 atattctcat atttcaaaca aaactacgtt 
37861 cactaaacaa ggacgtagta cacaattgag 
37921 ttccgggaag aatcctctgg tcgccggagc 
37981 tcgtggcttc gaggcaaaag acagacatcc 
38041 aggacaagat gttcaataaa tatagagaac 
38101 ggaccacgtg tacagacttc aggccctgcc 
38161 gcccctcctc cacggatggc cagcttcctc 
38221 cacttcgcgt cgatataaag cattggtatt 
38281 gctttacaca ctgccagctt ccagagctta 
38341 tcatgctccc agccggattg tcgtagcctc 
38401 cagcagcctt cccggcgctg gcgtggaggc 
38461 cgttgcattg cagttgcaca gtattgtaag 
38521 aagaggaagg tgaaatgaag gcaaagagaa 
38581 cgttacagaa taagtcctgc aaagtggatc 
38641 tggtgtgacc tgcgtggaaa tcatataatg 
38701 cgggccattt tctaaactac actgttgaca 
38761 agtgagcagc tgcatccgct gagagtaagc 
38821 gaactgaaaa tgccaaatat gtacacaaaa 
38881 aagatttaac tatcaaaaat aaatttctac 
38941 tgtaataatt atcttaaata gttttcacac 
39001 ctcagctgtg tgagtctggc ttcggattct 
3 9061 gctggcagtg gactctgcag agacaagcgg 
3 9121 aaagctggcc tgagccccac tcctggacag 
39181 ccaaagctgg ccctgagcac cgctcttgga 
3 9241 ggaacggtca gacgtcaccg tgggctgtag 
39301 gtggaataat tcacttagac tcccgggatg 
39361 ggtgggaaat ctgggcatcg gtagacctga 
39421 agcgtccgct ccagtgtctg cccactccac 
39481 actgagcgcg caaggcctgc gaggagtctg 
39541 tgtgaggaat gctcagatga gaaggacagg 
39601 tgaggaacaa gttgggggtc cctggaggtc 
39661 gtccgtgggt gagtgccatc ttcccacaca 
39721 atatcacagc ctcctcagac gtatttcagt 

3 9781 ggggcagggc acagtgactc acgcctataa 
39841 gatcacctga ggtcaggagt ttgagaccag 
39901 ctaaaaatat aaaaattagc caggtgtggt 
39961 caggagaatt gcttgaacct gggaggcaga 
40021 actccagcct gggcaacaag agcaaaactc 

4 0081 taagtgacag gctgtttcca tggagctgag 
4 0141 gcgacgtgct gctcaaggca tagccatatt 
40201 tgcctgtcct gattttcaaa ttattctatt 
40261 aacaatccca cttcccaaac ttgtcaaatg 
40321 tgcctggggt ggccggtgtt cctgggacct 
4 0381 tgacagaaag aaagggggcg cgtggccctg 
4044 1 taggcttgtt cggatgcgaa tgggagaagc 
4 0501 agtcactgaa agtggggcgc tgtccacaga 
40561 taaggcaccc aagggccatc tgacacccag 
40621 accagcctgc tatcgctctc acgccagcca 
40681 ccttgaagta gaccgagctc ttaccaacat 
40741 taccgttccc cagtctcatg taatttgcac 
40801 aaatcaactt gcttgttact gggtctttcc 
40861 tccatctggt tatgtgatac atcacagact 
40921 ggaggacacc acttcccacc ctccgcccag 
40981 cagggagacc gaacatccgt gtgtggccac 
41041 ggcctccagc acggggcgtc cctctggggc 
41101 cttcccagcc ctgctgcctc ctgcccctcc 
41161 ctgtggccct tgaggccagc ggttccctgg 
41221 aggcggtttc ctttgtttgc cttttgaaat 
41281 aaggctggag tgcagtggca tggtctcggc 
4134 1 cgactctcct gcctcagcct ccggggtagc 
4 1401 actaattttt tttgtatttt tagtagagac 
4 1461 tgaactcctg atctcaagtg atctgcccgc 
41521 agaagagaga gagaatgcca gggcaaagag 
41581 aagaggaaaa gaatgaaatg gaggagcttc 
41641 aatgctgagg actcggcacc ctggcatctt 
4 1701 atatttgaca tagggagaga aaagtgcatt 
4 1761 aagccccagg gaacccccca ccctgggtgg 



ggcacactgc cggctgcaca ggttgctggg 
agcccgtctt aaatctttgg tttctttcca 
gggcatattt actctttaca cccagaattt 
agtgctctct gataatgtgg tccataaaaa 
tgggtggggc taaacgagga aacgagaagt 
ccaggaaatt ctgaaaatgg atgctgcctg 
caaaacccac ccggcagcat gaggaacggt 
ccgctcctta cagatcaaag tttaagccgc 
acagacatca tctacgtggg cacggataca 
gggtgccccg gggagcgcgg caggaatgcc 
cctcgcgctg cagaggaaac cagcgatcca 
gagacgcagt ttacactact ctggcttaaa 
tcagtgtgaa gaaaagacaa ggacaaatct 
tcagaggagt cctgcgggag tccaggaccg 
ctgactgccc ccaagcccgg ctgtgttaag 
tgacatttta acagtaacaa acttgccagt 
tcatccgtgc tcccagaaca gacgccaagg 
gccgccgtga tgaggcaggt gtggaggccc 
ccctcccgcg gatcacctga agcagggcca 
gatcaagcag cgatgtccta accgtaaacg 
tcaaccactt cccaaaactt ggccacaaac 
aaagtgaccc actgctacct caggcccaga 
tccctaacat tattgcaata gagccactaa 
tcgttaaaac ctaagaggcc agtgccccca 
ggcgtgggct gcaacccgca gatgcgtaca 
ttgttgacaa aagggttttc ttctaaaatc 

atttaaatgt atatatcaac ataattggtc .-^ 

tttcccagtt tgtgagcagg acgccggctt V,;' 

cttcaattcc tcgtgcaaag agtcgaacca 

ccacgtgtaa accccacgtg tccctcactc 

gggctgcagt aaaccccacg tgtccctcac 

caggggctga agtgggggtc catcccccat 

gagtccagcc cacaatgacg ggccctgggc 

ccgagtttgt aaaggggttg ccaatggcca 

ctcgaactcc agcaccccca tggaaagcag 

ccacccaccc acctccattt tctcagcagt 

tgcggggcga ctcaatgcgt gtaagagatg 

tggcccaacg ccagcaagag ccttcgggct 

ccctgcccag gggctcggtg gccacagtga 

ggacagctct atttatctca tttattaatg 

aacctaaacc ccttcattaa gtgacaggct 

tcccagcact ttgggaggtc caggcgggcg 

cctggccaac atggtgaaac cccgtctcta 

ggcgggcgcc tgctacttgg gaggctgagg 

ggttgcagtg agccgagacc tcgccattgc 

tgtctcaaaa aataaaaaaa tagaaataaa 

ctcacacaca cacaaattat atttttaaaa 

tatcaagata gacaggacgt atttcaagaa 

tctttaataa atacatacac caatattaaa 

agttgacaga ggaaagtccc tgtgcacggc 

ttcgacagcc tcttgccatc accaccgccc 

ggcctgggga ggggcagaga cctcctccct 

tttcttagag ccgcagctgg ctggctgagg 

gatgcgtgca ' tcctcttaca tgctgtgact 

ctttcgttgc agacaggcgt gggcctgctg 

cactctgtta acgagcgtca agctgggctt 

ctgccaagag ttcagaatcc acctgagctc 

gcccacacct aattcagtgg ccttttgaaa 

atggagaaac aattttcttt gcactcagat 

tccaaataca atgggcccca gtgggtgtgg 

ccagccagag gaggggtgtg gcacctgccc 

acctgcctcc ccgcgtcccc taggctgcgt 

tcctcctctc cttcccatcc tcggggtgcc 

ctgcacggag cacctgcact tggtgattga 

tggattcagt ggacaaagga ctgcctgtaa 

ttttttgaga tggagtctcg ctctgtcacc 

tcactgcaac ttctgcctcc tgggttcaag 

tgggattaca ggcatgcacc atcacacccg 

ggggtttcac catgttggcg aggctggtct 

ctcagcctcc caccttttga attatttagg 

cccaatgcca gcctggagca cagggtgtgg 

aaacacccat gaaaattatc tgcaaaacaa 

tccagaatgt gggtaacggc atctcattcg 

tataaaacat tcttacgttt tgtgtttata 

ggcagcttgt gacagacacc tcagggctca 
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41821 aggggatggc gctggtggct ggaggacccc tgcacgcacc tgacctcgat ggcagggctg 
41881 tgcaggacct ccccgtcgca gttccaggag ctgttggaga cggtgcagca gcaggagggg 
41941 tggctgctgc aaatgtgccc aaagcgcttc ttccccccct ccttgaggtc gctgtcctca 
42001 tcctccatgt gcttcgacgt aaactggaat ttcttgacgc gataaactto "caaaagtg 
42061 aagtcaaact accaagaaac agtgagaggg accattcagc tctaaacgtc atcgtctggg 
42121 tggggcagca gaacacccca ggcctgacag cgagcgctgt caggcctggg tgcacacacg 
42181 gccc?tctcg gaggatgcga ctgcttgtgc aggcccctgg agttccc.ttt ccagctgaac 
42241 Igacagctca gagatgaagg gaaacctggc acccaccttt cccggcgccc accttccccg 
42301 gcgccLcct tccccggtgc ccaccttccg cggcacttgt caaaatgact 'ctaatgctt 
42361 ttagaaacca cagcttatct acaatgacga acgcttttta cagaattctg "aaataatt 
42421 agcagtgatt attagaattt tgtattctct ttttaacaag cactacttaa aaagccacat 
42481 aaaaatagca ttagactaga tcttaaatag aacttaagta aaaaaagaaa tgtgactaaa 
42541 tttggagact gcgaccaaaa gttccgcttc ccccaacatg actagactcg ggtcttcctc 
«60l tgcafaggct acagaaagat gtggtagaag ggtccgaaca gaacccattt ctgggagcct 
42661 aaagagtcta ttttcagtgg cctgcaaggg ccacttacat ctttactgtg "gctttgtt 
42721 ctgtttttaa ataggaagag aaacaacggg taagaagttg tattataaag ^"catacct 
42781 cttagtttct gtgctcaaat acacgtgtgt gtatatgtat gtatgtatgt gtgtgtatat 
42841 gtatgtatgt gtgtatatgt atgtatgtgt atgtatatac acatatatgt ^atatacac 
42901 atacacacac atgtatacat acatacacac acacacacac acacacacac acatatattt 
42961 gagatggagt ctcactctgt cacccaggct ggagtgcagt tgcgtgatct "gctcactg 
43021 tagcctccac ctcctgggtt caagcgattc tcatgcttca gcctcccgag tagctgggat 
43081 tacaggcgca cgccaccacg cctggctttt ttgtattttt agtagagatg aggtttcacc 
43141 atgttgccca ggctggtctc aaactcctga cctcgggtga tccgcctgcc "agectccc 
43201 aaagtgctgg gattacaggc atgagccacc gcatctggcc tcaaaaattt atcgttttaa 
43261 atg.t.ttl? acaaggagta gttgtaacca ggagtttaaa gtttgtgaaa "aaaaaaaa 
43321 aaaagtttgc ccttccttcc cttgaattaa ataatgaatt cctacaacac ataaaccagg 
43381 gactgctgga acataaatta ctcgccagtg gaggctccat gcaagagcac cccacttacc 
43441 tggtcctgct ggttggtgtg cctgatgaga aatctcagaa aattgaacct ggagcatttc 
43501 cggatgagga tgaggtcaga agacccgtct cccaagtggg cagccgggga gaggcccctg 
43561 gggctccggc gacaagcaca ggacatgttt gtggcattga tggccagaaa cttcccacag 
43621 acgacttgcc actcctccac gtcctctgaa gcacaaagaa ccacggagat gtcacagtta 
43681 caatggcgcc gggcagcgcc ctgcaccagg acggtgggag gccattcggg ctctcacctg 
43741 ctcafgcatt tlgcagacac ttcatgcaat cgactcttca gctcaacacc aaaaacatcc 
43801 accacggaac tgtccgctcc ccgcaggaag tggtacccaa aggatcacct cccagaaggc 
43861 aggactgttg tccctctcct cacttggtga cttttggaca atttagaaca agggtcagca 
4 3 921 aaccaaggct gtgtgtcaga tccggcccgc cgcccgtctg tgtgtggcct S^gagctaag 
43981 aatgcttttc acgtttttaa atccttgaaa aaggtctggc gcggtggctc acgcctgtaa 
44041 tcccagcact ttgggaggcc gaggcgggca gatcacctga ggtcaggagt tcgagaccag 
44101 cctggccaac atggtgaaaa ccatctctac taaaaataca aaaattagcc aggcgtggcg 
44161 gtgcatgcct gtaatcccag ctgctgggga agctgaggca ggagaactgc ttgaacccgg 
44221 aaggcagagg ttgcagtgag ctgagattgc gccattgcac tccagactgg gcaacaagag 
44281 taaaactctg tcttaaaaaa aaaaaaaaaa aaaaaaaaaa ttagccaggt atggcggcag 
44341 gcgcctgtaa tcccagctac tcaggaggct aaggaaggag aatcgcttga acccagaagg 
44401 tggaggttgc agtgagccga gatcgcgcca ctgcaatcca gcctgggtga tggaacaaga 
44461 ctctgtccca ataaataaat aaatattatt gactgggcat ggtggctcag acctgtggtc 
44 521 ccagcacttt aggaggctga ggcagtcaga tcacctgagg tcaggagttt gagaccagcc 
44 581 tggccaacat ggtgaaaccc catctttact aaaaatacaa agattagctg ggcatggtgg 
44641 cacatgcctg tagtcccagc tactagggag gctgagggag gagaatcgct tgaactcggg 
44701 aggcggaggt tgcagtgagc tgatattgtg cccctgcact ctagcctggg ^"^agt 
44761 gaaactccat ccaaaaaaaa aaaaaaaaaa aagaagagaa agagaaaaaa atcaaaataa 
44821 taatattttg tgacatgtgg aaattctatg aaacttaaat ttcagtgtcc ataaataaag 
44881 tttcatcggc agatggccat acccactcac tgtacactgc ctaggctgcc tctgctggac 
44941 agagactgtg tggttgccag ggcctgaggt ggtcaccaag caagccctca atgggacatc 
45001 tgctgccccc acattctaga gcacctggga caggagtcca gactcagcac cacaacagcg 
45061 tttcagaaag ggtaacgggg cctttcacaa aacccactgc tccacacagc "oacacggc 
45121 aagtaacagt gacotgacgc tgcctggggc taagaagaaa tggatgcctg gtgccagaca 
45181 ggaaaaagga aaagcacagg tgggcaggga gaagaggcag tgccctgaga gaggacagaa 
45241 acccgcactc cagcacacac agtgacaaca ctgagcctgt gttttaggaa ttacccgcag 
45301 cttccaaacc atacagtgct ttcttctgct cctcctccag ctgctgcttg ctttgcctgc 
45361 aaacaaagca tctgaaagac aagaatatca tcacctttta aatatggagc tgcaggtgca 
45421 gtgcgtatgc ttggcacata tagatgtatt taeactcttt tcacatacga aaaaattcca 
45481 tttcaaaggc ttaacaaaat attttttggc atattaatta gaggtttcag aatgagacct 
45541 acttttacaa tgacacattc aacatgtcct tctctaaaag gtttggacat gaggagaagc 
45601 acagcatgag gacgctgagg tgccacctgg agacggtttc tgcctggccc "cagctaac 
4 5661 atagcctggc tgggagaaat gactgaggca ctggttcccc cagcggactc ctggggtgag 
4 5721 gactcattta ctttcagaaa acaaatcata cagggtttat ctgttacagt caataacaag 
45781 caaataactc agaaccatca gcttagaact tagtcattcg tctagcacgt ccttctgatc 
45841 ttgggggctg ccggattcac gctgtcaaac aggacactgt ttgcactgat ggaaatcgct 
45901 catcagcagc cgagttcttt aatgcactga cgcaaacagc atctgcctct acatattctg 
4 5961 attttatggt ctccaattct tcacttaaaa aaggaaagga gctagaagga ggttctgaag 
4 6021 tctgcatgac ttaaaaggaa aggacgtgag tgtccttcac aaaggcgcag ctgtctccac 
46081 agccccgtgt cctcagcccc aggcagagca gagacaggat gaccattcag gcgccctcca 
46141 tggtcactag ggaggtcatg cggcctggga caactatgtt tagatttatt taaggttcta 
46201 tttacactgg tcgcatatta ctgaatacac tgtgtggggg tgggggcgga gatatgccag 
4 6261 gtgaaaggag tccctgcttg. ttcagtggcg tagccaccat tgcagagcct gcagtggcca 
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aactggggga ccaaatgtaa ggggagagcg cagcgtttcc agctgtgttg tctgcactca 
gcctcccctc cccggtgggg cgcctctttc cctttgcatc ccctctccct gggcagctgc 
cctggctctg ccccccaacc cctggcacca gcccttgttg ccagcccgag ccttcttgct 
gagggtccct gccttcctgt ggggttgaag gcgtggctgg cctgggctct gctaacctta 
gagtgggctc ctgccttgga gcatggggct aacatgggtg agggtctcag ccaggggtgt 
ggacaggagg gtggctgctg agagtgcaca cttcagccag ctctaccaac ggtcctgcca 
ccagtgccct gcacacgcac agtcactgat aaggtgtctg ccatcctttc ctgcctgcat 
ttctcctcca tcaggcccct gcagggggta aagggaaccc cagcagggaa agaccctgtg 
ctcagccccc actccaacag tgggtgtggg accctccccc agagtcctca tccctggcac 
agcagcagct gcctgcgcag gccatgggga cagagatggg gacagcaagc cccatgcacg 
cctaggccat catgggcatc ataggcatcg tgtgtctggg cacgctacat ccggcacctt 
ccaccacatt tggaaaatac ttctgcattc accttcctaa ccgtcctgct tcatttcccg 
tcatttgcaa gagaaacaca caaagcaatg cacttaccct gcccggcagg gcttcctatc 
ccttggagat cccaccgtgt gttgtgcagg gaggaaggac actgtccctt catagcagtg 
gtgggagagg aaggtcttta aacctgggaa cagagtgcag tgaagaaaaa acatccacaa 
gggtcattgc ttggttagga tgctgtgctg ggcagcttca gaaatgtcgc tgagcgcgca 
tctgcctaaa ccgtctgcag gagggagagg ccgggcctgc ttcaggccat gtcaccattg 
aaggcagtgg cagggtctct gccctccctc aggacaacag cacgtctagt catagtgctg 
gtgccttctg acacccagtg atgctgggac agccagggct atggccactc ttgacgctgg 
caaagtgggg cctttgccat tagccacccc tcatagcctc gatgatgtgg cctgtgggct 
ccagtggctc agggtcatcc cctccacctc ttcatgaggg tcagtgatgc ccccacctgc 
ctcagggctt ggcccccact ggggtcagcc acagcccctg tgacgtcctg aacagctcag 
gacagaggcc agggccgctg ccacccaaag tctggggcca ccctccaggc aaacgtgtgc 
gctgtccacg gggctgccag cagcctggca cccgggctgt ctgggaaaca ccacccagca 
ggacagcagg gactgggcag gaggcaggag aggggcccca gggtgtcgtg ggccacacca 
ggagccaggc cgtgtcctgt gagtctcagg gcgacttcac tcctccgtgt ctgatgctgg 
ggctggtgct gccctgtatg acatctggtg tggcagtgac ggcctgcagg gccgtcacac 
acaaccttac ccccgacaca cagaggcctc acatcctcat ccgcagatgg caagcccaca 
gccaagtgtg caccgcccag gggtggggga agctgccggc aagagcagtc tggctcctag 
gacatctgtg tgcctgaaag tcagctcccg actgtcccta ggacagcccc gtctgtgggt 
cttgcacctg gcatgggcca ggtgcctgga atgcactcag cgggcgggcc cttccaggtg 
gctgcagctc cgcctccctg cagagtggct ctggcctctc tccctgctcc tcccatgggg 
gtcccacaag gacgtcgtgt ccacggccct gggaggaggc cgacccactg gtcagcaacc 
ttccaagccg gctcctctct caagtgcagg gggcaggggg gcagctgcct catcacccag 
acgcagcctc cagggtggcg cctcaccgac atctgatgtc ccggtttctg tctgccccta 
gatgggcccc agatgtccta ccggcaccta aaactccacg tgtctcaaca gagtgtgtca 
tttccacacg ctcattctcc ttccatgccc tctgtgctgc tccagaacaa ccgggtgctt 
tctgcctcac agccacagaa cccaaacccg ctttcctgag agcatctcag gccctgtggc 
caggatgggg atccagagac ccggaaggcg tctcctgcct cggggctctc atcctggtgg 
gagggacaga gctgcagcca ctgcctgtgg tgatgggctg agggcaaagg ggggggccag 
gcatgcaggg gggctgccca ccaccccacc ccagcaggca gggacaggga gggggcatcc 
caggggcatt ctaggcagag ccacactggg ctcaacaaag ggcagtcggg agaatggcac 
aacccaaacc gcttcctaaa attctcgcgg cttctctaat tttctcggta ttccctttct 
taataagcct tagataaata cctcactttg ccatttctta ggttttaatt ttcaagccac 
actagttttt aatttcatta aatcttttca agcttcgtaa agtttgccaa atcactattt 
ctttctctca taagggttaa gattttagta tcaggaatta ctcgatttaa tttgatacag 
atttatctga aagcgaactg taaagaggcg ggaaagtcga ccgggcagcg agaatggcac 
accccatggt ctacatattt tcttctttct ttcttttctt ttttttttga gacagggtct 
ggctgtcgct ccagctgcag tgcagtggcc tgatctctgc tcactgcagc ctccaccttc 
tgtgctcaag ggctcctcct gcctcagcat cccgagtaac tggggtgccc ggctaatttt 
gtttttaatt tctgtggaga tggggtctcc ctatgtcgcc caggctgaaa ttctactttt 
cttttttttt gagacagagg tctcgctctg ttgcccaggc tgaagtgcag tggcacgatc 
tcagctcact gcagcctctg cctcccgggt tcaagcaatt ctcctgcctc agccccccga 
ggagctgtga ctataggtgt gcaccaccat gcctgactta ttttgtattt ttagtagaga 
cagggtttca ccatgttggc gaggctggtc ttgaactcct gacctcaggt gatccacctg 
ccttgacctc ccaaagtgct gggattacag gcgtgagcca ctgcccctgg ccaattctat 
gtatctttaa agttctcctt ttccagcgtt taaaagctgg tgacgtcgca cccactgcct 
gatgtgcgca gactgcaact cttgtgtctg aactggcacc agccttcatt tctggttcat 
tctaaactct gtggtcgggt aacaggtcac cgccgcctgg cccgaggccc cagcagcagc 
cccgggagcc tggaggacct gcatgttccg aagcccaccc agacctgccg tgttggatgc 
tctggggggt ccccgcagcc tgggttctac caagccctcc agaggatgca ggtgcccacc 
acagtctgag gaccactggg ctcaagaaag agtgaggcag gatgcgggca acctctttgt 
gcctgttttc tccgctataa agtggggcag gagcacctgc tggggtgaga agagcactgt 
ggagaatgaa agcttttggg cttcagggag tgcggcgtgg cattcgctcc cactcctcct 
cctctgtgcc cactgcggcc gagccgcagg aaggcaggat cttgcctgtt ttgttcatgg 
acacgtccct gcctcaaaaa ccaggcccgg catagagcag gggctcaaca aatacgtgct 
gagtgtatct cacttgtatg gctagctacg gtgtaacccc aacaaaacca cgcaaccttg 
ttgaatgaag taaaggatac ctgtgcctga agttacagaa gagcttccac acagatatcc 
caggctgaat ttctgcacaa agttcttgcc actttgacta ataagctcag agtgattata 
aaactattgt tttacaggcc aggcatggtt gctcatggtt gtaatcccag cactttggga 
ggccgaggtg ggcagattgc ttgagctcag gagtttgaga ccagcctggg caacagggca 
aaaccctctg tctacaaaaa tatacaaaaa aaaaaaaaat tagctggtca cggtggcatg 
tgcctgtagt cccagctact cgggaggctg aggtgggaga atcacttgag cctgggaggt 
caaggctgca gtgaaccgtg atcgtgctac tgcactccag cctgagtgac ggaggaagac 
tgtgtctcaa aaaaattttt aaaaaatata ttttaaaaac tatttttttt ccaattaata 
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50821 cttccacttg aacagtttgg ggtggcctct 
50881 tggcctggcg tcttggggtt ccagcacctg 
50941 tgtctactct gtggcttatg cttgggtcca 
51001 gttccagtcc agaccctgcc acgttctggc 
51061 ccgtctgtga gcaggtgggg gcccacctct 
51121 atctgtgaag gcagtc'gggg cacgaccagc 
51181 ctgaaaagtc gtatctggca agacccaacc 
51241 ccccgtagaa gccgtagccc agcagggaca 
513 01 ggactgagg'a cacatccatg gccagcgagt 
51361 ggcagccccc ctccagcccc gccccatcca 
51421 gtgggagtta cttttaaaaa gtgtgaaaca 
51481 ctcttggggt gttggatgcc atatttttta 
51541 aaaacaggct cggggagcta tcctgtgggg 
51601 gaaatatctt taactgaaag gacggtggtt 
51661 aaaatacaaa atcaagccga gtgcagtggt 
51721 ccgaggcggg cagatcacct gaggtcagga 
51781 accttgtctc tactaaacat ataaaaatta 
51841 cagctagtgg cgaggctcag gcaggagaat 
51901 gagccgagat cataccactg tactccagcc 
51961 aaaaaaaata aaataggcca ggtgcggtgg 
52021 ggaggctgag gtgggtggat cacctgaggt 
52081 gtgaaacccc gtctctacta aaaatacaaa 
52141 atcccagcta cttgggaggc tgaggcagga 
52201 cagtgagcca agatcgcgcc actgtactct 
52261 caaaataaaa taaagtaaaa tagaataaaa 
52321 cagtggctca cgcctataat cccagcactt 
52381 cccaggagtt tgagacacag ggagaacctg 
52441 tgtagtggtg cacttctgca atctcagcta 
52501 agttctggag gtcgaggcta cagtgagtca 
52561 acagagtggg accctgtctc aaagaaaaaa 
52621 tgtaatccca acactttgga aggccgaggc 
52681 accagcctga ccaatacggt gaaaccccat 
52741 tggtggcgcg agcctgtaat cccagctact 
52801 cttgggaggt ggaagttgcg gtgagccaag 

52 861 actgggaaac tccatctcaa aaaaaaaaaa 
52921 gtttctgtga cttcaggtga ggcacggtaa 
52981 ggcacaaaag atctgtcctg ccagagccca 
53041 gctcttcctg accggggctc tgcatcctgg 
53101 cgttccacag ctggcagccc aggcacagct 
53161 gggacgcggg tggtttccga gcctcggggc 
53221 gtcaggtgag ctgagtacgg gcatcgcgca 
53281 ccacactggg cgtgccccca ggaagcccgt 
53341 gagtgggatc acagcgcagc gagcagagca 
53401 ggacggcaac ggctggaaca cgcgacgggg 
53461 aggcccggct gccaccgtgc gcatgcgcag 

53 521 gccgaggttt ctgcgtcgct ggtgcccacg 
53581 gtaaagggaa aaagaaggtc ccaaatagca 
53641 attcagtgag tggtacctca ttcctttcct 
53701 aaatagaatt tacctacaaa atgaatatag 
53761 tgagattggg ccgggcggtg gctcacgcct 
53821 ggcggatgac ttgaggtcag gagttcgaga 
53881 tctactaaaa atacaaaaat tagccaggcg 

53 941 caggaggcta aagcaggaga atcgcttgat 

54 001 atcgcgccac tatccagcct gggcaacaga 
54061 aaggagccaa aaagttaaaa aaatatatat 
54121 gtgtgtgtgt gtgtgaacat ttacaattat 
54181 gagacggagt cttgctctgt cgcccaggct 
54241 caacctctgt ctcccaggtt caggcaattc 
54301 aggcacgcgc caccacgccc tggcttactt 
54361 atgatggcca ggctggtctc taactgttga 
54421 agtgctggga ttaggcgtga gccaccacac 
54481 atgcggcaaa ggtcacacac agccattact 
54541 ggcagctcag gtccctgctt cagcccgatg 
54601 tgcatctttt tatcctgcgc ttcactggct 
54661 ctgatcgcac ggccttccct ctgaggacca 
54721 accacccata cctgtgggag tagctgtgct 
54781 cgggagttcc tcatccaccc ggccagcact 
54 841 ctcccgttca cgcgagaagg acggggctcc 
54901 tctccttgag cctctctgtg cctaagtcat 
54 961 agcagatgca gttacaggct aacacccgat 
55021 cctatcaatt tgctaatgca tctatactaa 
55081 acatgctaca tcaggtctca aagcagccgg 
55141 ctgtcccgga ggcagccatc cacgggaggc 
55201 aacccccacc tgcccctcct gcctctcctg 
55261 tgcaccttct agcggctgga accctggagc 



gtcacctgcc taccagatgt gagaggctgg 
gggttctgga gtgaactgcg tccgtcatgg 
gggggcagga tctggtttag agcaatccag 
cccggcacat ttcatgaggc ccttagcttc 
gacggtgctc agtggattca gtcaggtaac 
ggggagcgag tctgcattat tctgagttac 
accgtttctt ctcactgtcc ttgatgatgt 
cggagtagcg aaggagtgtg ctgttgtggt 
cccctgtggg agagaacggc cgtgagggaa 
ctccatcccc ttcaatagca ctttgcaaac 
aacagaacgc attcagtgca aggattctct 
ttcaaatgac tgaactccac tcaaatttct 
actgaacaag agtgtatccg tgaggctcat 
ttagaatgaa gaaaaagtac atggcatttt 
tcacgcctgc aatcccagca cgtggggagg 
atttgagacc aggctggcca gaatggtgaa 
gccaggcatg gtggcgtgtg cctgaaatcc 
cgcttgaacc cgggaggcgg aggttgcagt 
tgggccacac acagagtgag actctgtttc 
cccgcgcctg taagtaaccc cagcactttg 
caggagttcg agaccagcct ggtcaacatg 
atgagctggg cgtggtggca ggtgcctgta 
gaatcgcttg aacccgggag gcggaggttg 
agcctgggca acaagagcga aactctgtct 
taaaatgcaa aatcagaagt aggccgggca 
tgggagggca aggtgagagg atcacttgag 
tctctacaaa aaacacaaaa attagccagg 
cttagaagac tgaggtggga ggatcgcttg 
agatcatgcc actgcactcc agcctgggtg 
atatcctgac caggtgctgt ggcttacgcc 
gggtggatca cctgaggtca ggagttcgag 
ccctactaaa agtacaaaat gaactggacg 
ctagaggctg acgcaggaga atcgcttgaa 
atagcgccac tgcactccag cctggggaca 
aaatcctaag tggctatttt aaaatataca 
ccctgtctac tggggtgagg aacaggcgaa 
ccgtgctgag tgccatgggc agccccttct 
actgaagcct cgaggcctca cagaggcatg 
gcccgtgccc actgccacag gcagtatggc 
agcagcctct ggaggaggaa ctgagcaaga 
tgtggtggat tccgtgggaa agaactcggc 
cctgacaacg tggcctcgga gctgcgtgct 
ggggacacag cgtgacccac ggccaggaca 
accagagtgg gacaggcctg ggggcgcagg 
aggaggggct caccaacaac gatatgcagc 
gtggagtaac acacgcagtc cgttgaccct 
tcagaataac agaattgtaa tcgcctccca 
tccatggccc tagagtcaac actaaggaat 
aaaatactag cgttgattcc tcaagaatat 
gtcatcccag cactttgaga ggccgaggtg 
ccagcctggc caacatggtg aaactccgac 
tggtggcatg cacctataat accagctcct 
cccgggaggc aaaggctgca gtgagtcgag 
gtgagacttc gtctcaaaaa aaaaaaaaaa 
atgtgtgtgt gtgtgtgtgt gtgtgtgtgt 
aggcataaca aaattttttt ttttttttcc 
ggagtgcggt ggcgtgatct tggctcactg 
tgcctcagcc tcccgaggag ctgggattac 
ttgtattctt agtagaggca gggtttcacc 
cctcgtgatc cgcccgcctt ggcctcccaa 
ctggccctcg taaacttctg aaacagctgg 
tgagatctta ataccaggac agccaggaca 
ttccccgtga acggggccgg ctgcggctta 
cagatgaaag ggaagctgca gctgccccgg 
tgctcaaaag aatgatttgc atccacaatg 
ggttctcaac attccccaca caactgaata 
gccgccaccc gagtaacgac caaccacatg 
aggggggttg ctcccaggac atgcgcatca 
tagcaggcaa tgctggctga agttacaagc 
gaaggaccga atgaatgacg atctcttctt 
tacatatgtg gaagagactt ggaaggtaaa 
agccatgcct ttcagatgct ccagtgacat 
agctcaacac cctccagatc tcaggacccc 
cccccagcca gcaagacaag cagtgacctc 
atgagtgagt tcaggtacct gctgccacca 
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55321 ccggccgtgc gcatccaggc ggcctcccac agggtctgtg gctgtcaggt ctcactcaca 

55381 actcttgcgc ccccaggctg ctccccaggt tgccatggtg accgttacag ggccaccctt 

55441 gggtgtctcc ctcaccctta ctcctccact caaggcgatg ctgcggagca gagactccac 

55501 acagcccccc accagctaca ccccggagtc tcagacccca tgcagcgccc ttaccgtcca 

55561 agccaccagc gttctcaccg gggcccacat gctcttagga ccaaacgcag agagcgatat 

55621 cggagcagag tcagacgtca ccccctgccg aaccaccgcc tccaagacct cctgctgcat 

55681 ggcagagtgg aactcggcat ctccctcggg ccctgccctg cccctccacc cccaccccca 

55741 gccacactgg cttccttgtc ccgaacaccc ccagctcctc ccacctgctg ccctctgcac 

55801 cagctgactc ccggcctagg cccctcacct gtcacctctc agaaaggcct cctggacacc ( 

55861 catggaaagg ggccacaaag agaccctctc cccagaccct cccccagtgg catttgctac 

55921 aatgcgcacc caccctggga aaggctttta tgcccacatt tggtggggtg cctgctgtcc 

55981 tctcctgggg cagaggccac agccaaggcc tagaggaggg cagggcacac agcggtaccc 

56041 ccaccctgac cacggggcag ggcacacagt ggcacctcca ccctgaccat ggtgaagtgc 

56101 agcctgcatc cccacgagca ctcctgagcc gcaggtctca gctcctgagg tcgttgtgga 

56161 ttttaagagc tgaggatttt agggcttagc actgccactg gcaaatggtg acaaaagaca 

56221 ggtttcttgc aaacaaggta agaaccatac agcatagatg agaatgcaga aaaacagaat 

56281 tcatggaatc cagaacgcac cacatacaaa gttaaaagac caaaacctaa tttggaaaag 

56341 agttgcaaga cacaaaaacc aatgaagggc aaataacgta taaacaatgt tatggtcaat 

56401 tagaaagtga caggccgggc atggtggctc acgtctataa tcccagcact ttgggaggcc 

564 61 gaggtgggca gatcacgagg tcaggagttc aagaccagcc tggccaacat ggtgaaaccc 

56521 tgtctctact aaaaatacaa aaattagccg ggcacggtgg tgggtgcctg taatcccagc 

56581 tactagggag gctaaggcag gagaattgct tgaacccagg aggcggaggt tgcagtgagc 

56641 tgagatcatg ccactgcact ccagcctggg tgacagagca agactccatc tcaaaaaaaa 

56701 aaaaaaaaaa aagtgacaat gccctcttgc ttccccatga gaacatggac taagcaaggg 

56761 aaaggaagtt tacaaaagca cacaatgcaa acagcaagag gacctgtgaa aagcagtgtg 

56821 gcctcacaca taatgaaaga aatacaaata aagagagtaa ataaagatag tgaataaggc 

56881 agcatactca gtctatcaga tgggataaaa gttaaaccac agacaaacct ccaggctgtg ^ , 

56941 ccaggctcgg ggaaactccc cctcccagcc ccgtgggtgg cctttatggg agaattaagt 

57001 gggcccagag actccacgtc aagggatgtc cacaaggaaa tcatgacctg gagcctctga 3 

57061 gagccacacg tgcacagggg ccagggtcct tcacgagtgt ggaagatgca gagagagtat ^ 

57121 ggaaagaagg atgggagtga ggggcctggg gtgaagacag caggagacgg tgccttgatc g 

57181 tcaggcactt gtggccatag ggagacagtg caaggccatc aacattggtt gtctgggtca j. 

57241 ttcaaagaat cggaacgggc tgggtccagt ggctcatgcc tgtaatccca gcaccttggg 

573 01 aggccaaggc aggtagatca cttgaggtca agagtttgag accaccctgg ccaacgtagc 

57361 aaaaccctgt ttctactaaa aatacaaaat tagccaggca tggtggtaca cacctgtagt 

57421 ctcagctgct caggaggctg aggcaggaga attgcttaag cccaggaggc agaggttgca y | 

57481 gtgagccaag atcgcaccac tgcactccag cctggacgac aaagtgagac tctgattcaa 

57541 acaaaccaac caaccaaccc aaagaactgg aacgcctctt gaaccagcgc cagccagcaa 

57601 gacaagtggt gggtgactgg ccaggcatgg tggtgcacgc ctgtaatccc agcactttgg 3 

57661 gaggcagagg ctggtggatc acttgagtcc aggagtttga gaccagcctg ggcaacatgg '* 

57721 tgaaactcca tctctacaaa aatatgaaaa aaaagttggc cgagtgtggt ggtgcacgcc 

57781 tgtggtccca gctacttggg aggctgaggc aagaggatca cttgagcctg gggcagtcta 

57841 ggctgcagtg agccacgatc atgccactgc actccaaact aggtgacaga gtgagaccca 

57901 gtctcaaaca aaaagacaag ccgtgatctg tgcaccttca ggggctggga gcctggagcg 

57961 tgtgagtgtg ctgagtgtgt gtgaacacag agctcgacct tgcacagctg tgctcagtca 

58021 ttcaacagat atctaagagg gagtttgggt gccagggctg ggtgccccac cgaggacaca 

580 81 tgagtgaaag ccaacctccc tgcctgtggg gaagacagac atcagccaag gcccaggggc 

58141 cctcccgagc cgtctgctcc cggccagctc ttccgctctc cccactccta actgcctggc 

582 01 aggaaccgag aggtcggcgg cacacaggag gcctgagaag gatgtgagga aatccagcag 

58261 aggctccccc gagatccagc gggatgttgc tataatgctt tggtttatct gagagacttg 

58321 ttattggaag gggtgctgag agctttttct ctattgagct atttctaaag aaccatttct 

58381 ccccttagtg tctcccagcc tggagccagg ctgttctcat tagacaggac tcttctcagg 

58441 tgagtgacat tcttctaaaa ctcatcaggc gatattatga aaagttcctc tctattaata 

58501 actgcatcac gccttcccta aaagccaaag ggaagggatc gtaaccagac gatcatcctt 

58561 tcctatatac ttaaaaccat gaccatggtc aacttgttgc taacaattat tagtaattca 

58621 cgagacatta tgctaacaca aatcccccta aaatgagtct tgatactgaa aacacccgca 

58681 atgtcaccat ccgtggggaa catttgcctc acaccattat tcacaagctc tgtcattaag 

58741 aaggagtaat gacttaagtt taaaggatga ctccctcaag taaacactgg cccacaatgt 

58801 cacctgtcgg aggcaggtta gagttgagag cggcacccac agagaaggtg gctcagggcc 

58861 tgtccctcgt gagggctagc ctttcccttg ctttgaggca aggtggcctc taaacctcca 

58921 actctcgctc ccatccggtg acctgtgacc tcacccccag tcctcctgtc ttcatccata 

58981 aaatggggat gactgcggtg ctcacatccc agagctgtga cgagagtgaa agtgtgcacc 

59041 atgccgccac cctcggggcc agacacaggg ccagagagga cagacacctc gtcccctgcc 

59101 aggtcatttg caaagcctgc aacacaccac gctcaggccc tcagcttgac ttagcttaac 

59161 actgtgactt ttcaaagaac aaggtggttc ctagaaggat gttgaatgga ccaagccgaa 

59221 ccagcacttc aggacacagc ccagctgttc ccgggctcgg ctaggtcctg cccttggttc 

59281 cgcccccacc tttcccactc agtctcctcc caactcccca gcccgattcc tgcccacacc 

59341 cacccctaac tcccggttcc ttcaggtcca gaagccactt cttccagaac gcaccccctg 

594 01 ccaacctacc cacctgctcc cgggcaccgg gggttcccca ctccaggact gcacccccag 

59461 cggtggggtt ctcctgcctt gcgtgaagca gcctttgggg gagtcgggca actttccttt 

59521 ccacgagtgt ctatttcatt ggggtcatac tctcaatgat atccccccaa gagtggccct 

59581 aatgttagga aatctgtacg aactaaactg agtataattg ggttattaag tgtccgaggt 

59641 ggctacttgc gcagaacaat tcctaaggac gggaacgaag agaacagaga atgccaggcc 

59701 ggctccatac ctgcgggaat gattccaatc cggaggctac tggggaccag cacagcccgg 

59761 gggtggttct ggtcgacccc ggcgctcctc tgcgtcctcc caatcagacc gtgcagcacc 
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59821 tcgctgaaca taccatctcc gccgacacag acgatgctgc aggaggaaca cagccggtca 
59B81 gggctcctgc aggtgcggcc ctctgagcgc agcaggcctg aggcttccag cttcaggagg 
59 41 ggcagcctgc aclcctggcc cttggcaggc atctagaagt gcggctcctg "atgtccct 
60001 aagtgataag gtcgtgcctc tgtgccagtt ggcctagacc acgtgtacaa ataatgagat 
60061 tgltggcaaa tactggcttt ccttctgtga gtctggaatt tgggtggeta 9gcagagaat 
60121 gcctgcatgc ctgtaccccc tcaaaaacac tacactctgg gtctcaggac ".agtttgc 
60181 ggggcgggaa gagtacacct gtgctgctgc atttgtggga gcagagaggg tgcactccag 
glgacccgca ctgcgggaga ggcacagcag tctgcgcggg 9acgtttetc «tcgce..g 
60301 ctggctgtgc agccttgctc tgtcccgtca gaaatctcag ccg«gagcac 9actctagct 
60361 caggxagcL atcaccagcc gtgtgggtgg tcttgaggac ccaaggcaag ^cagagggg 
60421 acggaataaa tgcggacaaa atggggcatg agacaatcta aggagagg-g • t S9agaaag 
60481 ggglcagtgg ctgaagggag aaaggacggg ttggtttcag ggtgtgctgt cctatgccgg 
605 1 glgtaacccl calggaacc! acactgaggt ggcagaatga gaggtgcagg catggagaca 
606oi gacatgtcac tgcagcccag agggaaggca gaggacaggg ctcaggtttc ctggacatct 
6066 «"agcct. cggclgccca agagctctca tcacccggga agagagggtt 9cagggggaa 
60721 ggtccgccgg tagaggagcg gtgtgagtcg actagggggt gcagcctgag 999cctccct 
60781 cLgcltgfc agcatcaagg tgcctgccca ttccccagtg ggtctgtctg ^aggcgcg 
60841 accagtctcc agggctggga tctctgccca ccacccacca ctcaggccac ctgctttggg 
60901 cccggctccc atgggaaggc tctttgccct tccccgtago ctacaccttg caccggggca 
g«ffccc.g cggggctgtg gtttcgagga cagggcatcc ttgaatgggc agtgtagq.t 
61021 ctgtgtgtta gacagacaga tcactcagac agacagctca gccgcaggaa gagcagccct 
elosi cactgglcag gcctlccagc ctcctgcacg gcatgtccct gggtttgggg agaaggcgtg 
SilA ccctcaagag gcgaggctga tcccttggct agggtgggtg caataaggtg ccagccccat 
61201 ccctgctgal Lgtgaagac gactgtcttc ccgccaagct gctgtcgggt ggaggtggtg 
61261 ggaagaagta catggaggag ctgccatcgg gtggagttgg ggagatgtgc atggaggagc 
61321 cccctgcftc ctggagccca tgaacagaac actcccgcat taggagggga gcaaaggggt 
ctccSgtc. gcacctgggg gcaggggctc cttttatttc cctttctaac cttagcacc. 
61441 qcagagtgct gggacctgga ggaccctcaa taatgtttgt ggaatgaata atgatttcaa 
61501 faaltltccg cfgcaat^a taacacattg ctcaccgact ccagatggaa cagagagaag 
61561 aaaacaaaaa gccaaccaat gacaaggcaa ttaactacag gggaatagct "«"aaga 
61621 tgaagagtga aattttaagc ataaagaaca gcacagaaga gctggacaga cttgagtaca 
tgaatttgaa aacttctact gggaagaatt aatgccataa gcaaaattaa aagacaatct 
61741 caaaattagc aaaaatattt gcaaaaatat gaatataaat atgaattgcc tattatttgt 
61801 tggataagag aacaaatgac tgaaattatg gccaacaaaa gaaaatgggc caaagacata 
1 aaLtcaltt ccaagaagcc agctagccaa taaacatatt tcttaaaata "agagactt 
61921 ccacatgtca aattagcaga gaaattttat actgataata ccaaacatgt tgacaaagac 
6198 agtaaaLat atactltalg aaagaatgta aactggccgg gcgcggtggc tc.tgcctgt 
62041 aatcccagc. ctttgggagg ctgaggcggg tcgcgtggat cacctgaggt «ggagttcg 
62101 agaccagcct ggccaacatg gcgaaatcct gtctctacta aaaacacaaa aattagctgg 
62161 gcgtggtgtt gggcacctat aatcccagct actcgggagg ttgaggcagg agaatcattt 
«22l gglcffggga gglcgaagct gccgaggtca cgccattgca ctccagcctg 99caacaaga 
62281 gcgaaacfct gcctcaaaaa cgaaaaaaaa aaaaaagaat gtaagttaat agaatctctt 
^cgaaagcaa Ltggctgta cacggcagag attttaaaat gttaatctcc "tgactaat 
624 01 ttcatttcta ggaatttacc ttaaaaatgt agatgtatgc agagagattt «3t.ca»ca 
62461 atqctatcct aagagcaaaa attggaaaga gcctgaacat ctacaagtga gaggctaaat 
1 atatggca tgcccacata atggaccatg atatgggcat taaaaatagt agttctggga 
62581 aatctctagg aattttgggg aaaagaagaa tcccaaactg taaaaacgac agcatctaaa 
6 6 1 "ttcttgl? aatgtgctlg gttgatttgc ttctctgtct acacagcacc gcataagcac 
62701 atcttctgaa agaccgaaag ggaacagaac cagcaaacaa ccattctcgc ccagcccagc 
6 761 cagagaggag tgtgtgactg cacttttctg ccttttcttt cttgccttcc tgtaagaagc 
62821 atgtttaltt ttacalttgg tcgtgggtgg aggcgggggg cggatttagg -aaggagct 
62881 ctcaaaactg agaggtgggt agaacttttt cattaacaat ggacttgatg 9^atgaaag 
62941 acggcttact cacccgtcgt atttgtctat gttaatctca tacagagtct ""ggcctg 
3001 attagcatgt tcagtaactg tggggaaaaa ttacatcaca "gtctatac tcacatctgt 
63061 gagtcctcac gtaaaaggca aatcatccat tcttttaaaa tgagttcctg taacaccttc 
6 121 aggcggattt cctagataaa attccctctt caattctaca tcgctcttat 999'cgggat 
caaaccaacc tcagggaacg tgaacatgaa cctctctcgg gccacagtct gcgcgatgac 
atgcagcgtt c"gg!at« alttgcclgc aaataataat tcctttgaa. gagattacgt 
63301 ctctKtata atttttcctt attatacaag ttacttccaa aattgtacac tataccaaag 
gtagtaaga agatgttagt gcagtctcta tacctgatgt aaacttctat tgaaagtccc 
63421 cccaccccac aacaggcccc ggtgtgtgat gtgagattat ttctaagact "ttaatcct 
634 81 ttgtgtctcc tcttttgaga atttectgtc cagaatcgtt gcccttttat "tcctagtg 
63541 tttttcttga ttgccttcta tgctgagaat tggctggaaa atactaagtt atttattttg 
63601 tattcttocc atLtggtat gtaaaaaaaa gttctatttt gtttaaaata atctcacagg 
63661 ataaaaatct tgggctgggc acagtggct.c atgcctataa tcccagcact "gggaggct 
63721 aaggtgggca acattgcttg agcccagggg ttcgaaacca gcctaagcaa "tggtgaaa 
63781 ccccgccLt acaaaaaata caaaaattag ccaggcatgg tggcacaagg tg^ggtgcca 
63841 gctacttggg aggctgaggc aggaggatca tctgagtcaa ggaggctgag 9«acagtga 
63 901 cccacaattg caccactgca ctccagcctg ggcaacagag tgagacaccg "tagaaaaa 
63961 gcagaaacgt ctctagtgac tatacttgaa tcattctcaa acttacttct acatagttaa 
"021 Ltlgaatlt gttaccgatg atgtcagtgg tgatggaggc taaggtgaac agtgg tgcc. 
64081 cttttctttc atatatccgc ttgccttgtc cttttcctcc aaacgggttg "aaatacca 
64141 gtaaatgctt tggtctggac gctgtaagac aacaacatgt aaataacaac 9«aataaca 
64201 Latcaaaac gaagagctct gttactggac aaaatcaaac catgagggtt "agaattct 
64261 gcttagtatt acctgatgga aagtattttt aaatct.taaa gaacaatgac aaagagcttc 
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64321 ctgatctatg aaaagagaaa atcaacaggt tgtccaactg ggcaaaggct atacacacat 
64381 gtcggacatc atgaaactcc cacgaactca gaattccgac tgagaagggc tctctctgct 
64441 ctgaagggtg ctgtcgggta cacatgctgg gggcagcagc ctcgttcttc tgcaggtgaa 
64 501 ggaggtgagt gtgcagacag caggaagaag gaagcagaca gagggaagag tcttaggaaa 
64 561 agcggaccag caggtatgga aagaagaagc cacacaggga gatgcttgca gagatggaaa 
64621 gacagaaacc agtgacctcg tttccatctc tccaggggcc ccgccaaact cccatgcctg 
64681 ggtttcctca gtcaccccca ttttcttttc ttttcttttt tttttttttt gagatggagt 
64 741 ctcgctctgt cgcccaggct ggagtgcagt ggtgtgatct cagctcactg caagctctgc 
64801 ctcctgggtt catgccatcc tcctgcctca gcctcctgag tagctgggac tacaggcgcc 
64861 cgccaccacg cccggctaat tttttgtatt gttagtagag atggggtttc accgtgttag 
64921 ccagagtcac ccccattttc ttataacaac ttccacattt ctgttttggt gggttcatgt 
64981 gactctgcta cttaaaagaa cacatttaac taacaaatca ataaaagtaa caagacagta 
65041 tctacccaag gaaactgaaa acctaagtcc atacaaaaat ccagacacag ggccaggcgc 
65101 ggtggctcat gcctgtaatc ccagcacttt gggaggccga ggtgggcgga tcacgaggtc 
65161 aggagaccgg gaccatcctg gctaacacgg tgaaatcccg tctctactaa aaaaaaatac 
65221 aaaaaaatta gccgggcgtg gtggcgggca cctgtagtcc cagctacttg ggaggctgag 
65281 gcaggagaat ggcatgaacc cgggaggcgg agctggcagt gagccaaggt cgcgccactg 
65341 cattccagcc tgggcgacag agtgagactt catctcaaaa aaaaaaaaaa aaaaaaacaa 
65401 acaaacaaaa aaacacagac acaaatgttc atcacagcat tcttcaccac agccaagcag 
65461 tgggaacagc ctgaacgccc atcacctgaa gaagaaatcg agacgcagtc cacacatcaa 
65521 cggagcatcc ttcagtcatg aaataagcaa actactgggt gcgaccaaca tgaatcaatg 
65581 tgaaaacagt atgcagagag actggaggct tccacccagg agcacacgta aaacttctgg 
65641 ggtgacagaa gccttgatgg gtttgagtta caaaggtatg tgtgtttcag aagtcattga 
65701 ttgaaactta ggattttggc caggcatagt ggctgacacc tgtaatccca acactttggg 
65761 aggccaaggc aggaggatca tatgagcaca ggagttcaag accagccagg gcgatatagt 
65821 gagactctgt ctcgacaaac gatcaaaaaa aatcaggcca ggctcagtgg ctcacgcctg 
65881 ttaatcccaa cactttggga ggccgaggtg ggcagatcac ttgaggtcag gtgtttgaaa 
65941 ccagcccggg caacatggtg aaaccctgtc tctactaaca atacaaaaat tagctgggaa 
6 6 001 tggtggcaag cacctgtaat tccagctgct caggaggctg aggcaggaga atcacttgaa 
66061 cccaggaagt ggaggttgca gtgagccaag atcgcgccat tgcactccag cctgggtgac 
66121 agagtaagac tccatgtcaa aataaacaaa caaacaaaca aattagctgg gtgtggtggt 
66181 atgcccttgt agtcccagct actctggagg ctgaggtgag agaatcactt gaacctggga 
66241 ggtcgaggct gcagtgagct' atggtcgtgc cactgcactc cagcctgggc tacagagtga 
66301 gacattactc cagattctac caatattaca aagaaaatga aagaatttac taataagttc 
66361 atgctaataa actcacgaat ttggatgaaa tggttaaact tctcaaaaag caaaatttgt 
66421 cagagatacc agaagaaata gaaaatctga atagtctgac* agctaataaa gacaccaaag 
66481 ctattaatga aaacttctca caaagaaaac tccaagccca gatgatttca caactgaatt 
66541 cttccaaaca tttaaggaaa ■ aaatagcctc aagattatag gaacgcttcc agagaacaga 
66601 aaaaaacaaa cacgtatgaa ctctttgtat gacacaccac tgtgaccttg ggaacaaaac 
66661 cagacaaaga tgttatcaga aaggggctca tattcagaca tccctcccct gaacctacgt 
66721 gcaaaaatcc taaacacaat atcggtaagt caaatccagt gacacataaa cagtactaca 
66781 tcacaaccaa gagaggtttg tttactccag gactgcaaca atgagaaatt aatgtaactc 
66841 aacacattaa ttgaaaaaaa aaaagaaagt catacagtca tctcaataga tgcagaaaag 
66901 gcatttatca aattaaatac ccattctcca cttaaaaaaa ccaaaactct cagaaaagta 
66961 ggattaaaag gaaattctgt cttaatctga tttttaaaat ctattaaaaa cataagaaac 
67021 atcattctta atggtgaaac actaaaatct ttcccctgag atcaggactg agacaaggat 
67081 gctgctagca cccctgtgtt ggtccaagcc agggcaaaaa cataaggaaa agaaataaaa 
67141 gagagaagaa ttaaaatgga agaaacacaa ctgttattat gcacagctga catgatcgtg 
67201 tttgacaaaa atctaaaata atcatcaaac tattagcatc cataagtgaa gtttgcaagg 
67261 tcattgaatg caaggtgaaa tacacaattc tatcacattt ctatacacaa acaataaaaa 
67321 ataaataaac agtgcagtag ggaaaacgag actttcccgt aaatgccaca gggtcaagca 
67381 gatgtttata gttaatgaaa atactaggtt gggcacagtg gttcgtgcct gtagtcccag 
67441 ctacttggga ggctgaggca ggaggatcgc ttgagcccac tagttcaagg ccacagtgag 
67501 ctatgatcgc atagctgcac tgcagcctgg gtgacagagc aagaccctgt cttaaaaaaa 
67561 tgaatacata aaaagaaaga ggaaaatgaa tcttgactgc tcacctcaca ctggacacaa 
67621 agatgaattc caaatggatt acatagatcc aaatgtgaag cggaaaacaa tagagcttgg 
67681 agaataaaac gtcttcgtga ctttggagta gacaaagata tcttcaatag gacacaagct 
67741 taccagaaag ttttttaaaa tcataaaaat aggccgagta tagtagctca ccctggtaat 
67801 atcaatgctt tgggaggctg aggtgggagg attctttgag cctaggagtt tgagaccagc 
67861 ctgggcaaca cagcaagacc ccagctctat gaaaaaattt tcaaaaaatc agccaggcac 
67921 ggtgacatgt gcctgtagtc tcagctactt gcgaggctga ggcagaagca ttgcttgagc 
67981 tcaggagttc aaagctgcag tgaaccgtga ttgtgccact gcactccggc ctgggtggca 
68041 gaacaagacc tcatctccct tttttttttt tttttaaaga tggagttttg ctctttcgcc 
68101 caggctggag tgaagttgca tgatctcgac tcactgcaac ctccaccccc agggttcaag 
68161 tgattctcct gcctcagcct ctcgagtagc tgggatgaca ggtgtgtgcc accatgcttg 
68221 gctaagtttt gtatttttag tagagatggg gtttcaccat gttggccagg ctggtcccga 
68281 actcttgacc tcaggtgatc cacccgcctc ggcctcccaa agtgctagga ttataggcgt 
68341 gagccaccat gcccggccaa gaccccgtct cttaaaaaaa aaaaatacga gattgaaaag 
68401 gcaaattaca gaattggtaa ttcttattaa ttacataaat ttatgtaatt aataaattca 
68461 ataaaagatt cagatccaca atatgtaaac agtttctaca catcattaag aaaaagatac 
68521 aaccagctgg gcatggtggc tcatgcctat aatcccagca ctttgggagg ctgaggcagg 
68581 tagatcactt gaggccaaga gttcaagatc agcctggcca atatagcaaa accccctctc 
68641 tactaaaaaa tacaaaaaaa ctagccaggc gtggtggcac acgcctgtag tcccagctac 
68701 tcaggaggct gaggattgct tgaacccagg aggcagaggt tgcagtgagc caggatcgcg 
68761 caactgccct ccagcctggg caacagagcg agactctgtc cccccaccaa aaaaaaaaga 
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SB821 aaaaaaqaat gaaagaaagg aaggaagaga aagataacaa ccaaacggaa aaatggacaa 
6 81 aagacttgaa cagaacctca caaaaggcga tatctgaatg gcccatgaac atatgaatgt 
68941 gtftctcLc ctcattaacc attagggagt gcaaactaca accacagtgc «gactacta 
IZ\ fcaccaccat ggctaagacg taaaggtgtg gctaagatgt ggggcaacca g a tec ga 

lilllllllflli 

69361 gcaagactac cogecccK 99tggaaaac ageg^aa -t -caag ^JJ 
69421 agecgagage "ggcacaaa areata c tgt g S catgaccaca 

s;s s:s s 4^ ™ 

mi §1 isszssmm 
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70 861 ggaaggtaga ggctgcagtg agctaagatc 
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□ 1 : CAB62977. dA59H 1 8.2 (novel ...[gi:6572330] 



Links 



LOCUS 

DEFINITION 

ACCESSION 

VERSION 

DBSOURCE 

KEYWORDS 

SOURCE 

ORGANISM 



REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



COMMENT 



FEATURES 

source 



Protein 



CDS 



CAB62977 326 aa linear PRI 07-J AN-2000 

dA59H18.2 (novel protein similar to human, mouse, yeast, worm and 
plant (predicted) proteins) [Homo sapiens] . 
CAB62977 

CAB62977.1 GI:6572330 

embl locus HSDA59H18, accession AL096766. 12 



Homo sapiens (human) 
Homo sa piens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 
1 (residues 1 to 326) 
Bates, K, 

Direct Submission 

Submitted (05 -OCT-1999) Sanger Centre, Hinxton, Cambridgeshire, 
CB10 ISA, UK. E -mail enquiries: humquery@sanger.ac.uk Clone 
requests : clonerequest@sanger .ac.uk 

This sequence has been finished according to sequence map criteria 
as follows. An attempt is made to resolve all sequencing problems, 
such as compressions and repeats, but not necessarily within known 
annotated human repeat sequence elements (e.g. Alu) . Where the 
sequence is ambiguous, there is an annotation using the 'unsure' 
feature key. 

This sequence was generated from part of bacterial clone contigs of 
human chromosome 22, constructed by the Sanger Centre Chromosome 22 
Mapping Group. Further information can be found at 
http: //w ww. San g er .ac.uk/HGP/Chr22 

During sequence assembly data is compared from overlapping clones. 
Where differences are found these are annotated as variations 
together with a note of the overlapping clone name. Note that the 
variation annotation may not be found in the sequence submission 
corresponding to the overlapping clone, as we submit sequences with 
only a small overlap as described above. 

The following abbreviations are used to associate primary accession 
numbers given in the feature table with their source databases: 
Em:, EMBL; Sw : , SWISSPROT; Tr : , TREMBL; Wp : , WORMPEP; Information 
on the WORMPEP database can be found at 

htt p://www.sanqer.ac.Uk/Projects/C eleqans/wormpe p IMPORTANT: This 
sequence is not the entire insert of clone 59H18. It may be shorter 
because we only sequence overlapping sections once, or longer 
because we arrange for a small overlap between neighbouring 
submissions . 

The true right end of clone 439F8 (AL021392) is at 100 in this 
sequence. 59H18 is from the library RPCI-6 constructed at the 
Roswell Park Cancer Institute by the group of Pieter de Jong. For 
further details see htt p : / /bacpac . med . buf f alo . edu/ VECTOR: pPAC4 . 

Location/Qualifiers 

1..326 

/organism= "Homo sapiens" 
/db_xref =" taxon: 9606" 
/chromosome= " 22 " 
/clone="59H18" 
/clone_lib="RPCI -6" 
1. .326 

/product="dA59H18 . 2 (novel protein similar to human, 
mouse, yeast, worm and plant (predicted) proteins)" 
1. .326 

/gene="dA59H18.2" 

/coded_by=" complement ( join (AL096766 . 12 : <43440 . .43645, 
AL096766. 12 =4 5295. . 45371 , AL096766 . 12 : 4 7078 . .4 7183, 
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□ 1: AI237625. EST234187 Normal i...[gi: 3 831 131] 



Links 



IDENTIFIERS 



dbEST Id: 

EST name: 
GenBank Acc : 
GenBank gi: 



2010430 

EST234187 

AI237625 

3831131 



CLONE INFO 

Clone Id: 
Source : 
Id in host: 
DNA type : 



RPLDB60 
ATCC 
2044267 
CDNA 



(3- end) 



PRIMERS 

Sequencing : 
PolyA Tail : 



M13 -21 
Unknown 



SEQUENCE 



GTGG AT ATGT C AAC AT AATTG G T C AGTCG AAATTTTCTT AAGTAATATTCAC ACTT C CC A 
AAGCTCCTGGGCTTTGGGATTCTTCCTTAGGACTCTTCTTCGATCCCCCGAGCAAAGAGG 
CGCACCAGCTGGCAGTGGACCCTGACCTCAATGGCAGGGCTGTGCAGGACTTCCCCATCA 
CAGTTCCAGGAGCTTCTAGAGGTTGGGCAGGCGCAAGGGGGGTTGTCCTTGCAGATCTGC 
CCAAACTTCTGCTTCTCTAGTTCCTTCAAGTCATTGTCATCATCTTCCACGTGCTTTGAT 
GTGAACTGGAATTTCTTGACTCGATAAACTTCAACGAAAGTGAAGCCAAACTGATCCTCC 
TGGTTG GTGTG C CGG ATG AGG AATCTC AGG AAG TTG AAC CTGG AG C ATTTC CGG ATAAGG 
ATGAGGTCAGAAGACCCATCTCCCAGATGGGCAAATGGGGACAGGCCCCCAGGGCTCCGG 
GGACAAGCACAGGACATGTTGGTGGAGTTGATAGCCAGGAAC 



Entry Created: 
Last Updated: 



Nov 2 1998 
Jan 31 1999 



LIBRARY 

Lib Name: Normalized rat placenta, 

Organism: Rattus sp. 

Organ: placenta 

Vector: pT7T3Pac 

R . Site 1: EcoRI 

R. Site 2: NotI 



Bento Soares 



SUBMITTER 
Name : 

Institution : 
Address : 
Tel: 
Fax : 
E-mail : 



Lee, NH 

The Institute for Genomic Research 
9712, Medical Center Drive, Rockville, 
(301) -838-3529 
(301) -838-0208 
nhlee@tigr . org 



MD 20850, USA 



CITATIONS 

Title: 



Authors : 



Year: 
Status : 



Rat Genome Project: Generation of a Rat EST (REST) Catalog & 
Rat Gene Index 

Lee , N . H . , Glodek,A., Chandra, I., Mason, T.M., Quackenbush, J. , 

Ker lavage , A . R . , Adams , M . D . 

1998 

Unpublished 



MAP DATA 



http://www.ncbi. nlm.nih.gov/entrez/qu^ 1 1/19/2002 



NCBI Sequence Viewer 



Page 2 of 2 



Revised: July 5, 2002. 



Disclaimer I WritejoJheJ±elp_Desk 
NCBJ I NLM I Nib 



http://wwncbinlm.nih.gov/^ 



11/19/2002 



NCBI Sequence Viewer 



Page 1 of 2 



^ Nucleotide 



JUl'..'^. v. . 

Taxonomy 




Clipboard 



Details 



C 1: AI536375. mbUaOlyl Soares...[gi:4450510] 



Links 



IDENTIFIERS 



dbEST Id: 

EST name : 
GenBank Acc : 
GenBank gi : 



2352535 

mbl4a02 .yl 

AI536375 

4450510 



CLONE INFO 

Clone Id: 
Source : 
DNA type: 



IMAGE: 329354 {5') 
IMAGE Consortium, LLNL 
CDNA 



PRIMERS 

Sequencing : 
PolyA Tail: 



-40RP from Gibco 
no 



SEQUENCE 



TTT AT CG AG TC AAG AAATTC C ACTTC ACG TCG AAG C ACGTGG AAG ACG AGGAC AATG ACT 
CGAAGGAACAAGAGAAGCAGAAGTTTGGGAAGATCTGCAAGGACAGACCCTCTTGCACTT 
GCTCAGCCTCCAGAAGCTCCTGGAACTGCGATGGCGAAGTCATGCACAGCCCGGCCATTG 
AGGTCAGGGTCCACTGCCAGCTGGTGCGCCTCTTTGCTCGGGGAATCGAGGAAGAGTCAT 
AAGCAAGAACCCCAAAGCCCAGGAGCTGTCGGCCTTGAGCTCGGGGAGTGTGGAAATTAC 
TTAAGAAAAATTCGACAGACCAGTTATGTTGATATATCCATTTGAATTTAGAAATTTATT 
TTTGATAGGTAAATCTTGGTTTTTAGA 



Entry Created: 
Last Updated: 



Mar 18 1999 
Mar 15 2000 



COMMENTS 



This clone is available royalty -free through LLNL ; contact 
the IMAGE Consortium { inf o@ image , llnl . gov ) for further 
information . 

This read is a RESEQUENCE of a previously sequenced mouse 
clone 

This read has been verified (found to hit its original self 
in the correct orientation) 
Putative full length read 
vector to vector length is 388 
MGI : 2 10754 



LIBRARY 

Lib Name 
Organism 
Develop . 
Lab host 
Vector : 
R. Site 
R. Site 
Descript 



stage : 



1: 
2: 

ion: 



Soares mouse p3NMF19.5 

Mus musculus 

19.5 dpc total fetus 

DH10B (ampicillin resistant) 

pT7T3D (Pharmacia) with a modified polylinker 
Not I 
Eco RI 

1st strand cDNA was primed with a Not I - oligo(dT) primer 
( 5 1 TGTT AC C AATCTG AAGTGGG AG CGG CCG C ATTTTTTTTTTTTTTTTTTTT 3 ' ] , 
double -stranded cDNA was size selected, ligated to Eco RI 
adapters (Pharmacia) , digested with Not I and cloned into 
the Not I and Eco RI sites of a modified pT7T3 vector 
(Pharmacia) . Library went through one round of normalization 
to a Cot = 5. Library constructed by Bento Soares and 
M.Fatima Bonaldo. RNA was kindly provided by Dr. Minoru Ko 
(Wayne State University) . 



SUBMITTER 
Name : 



Marra M/WashU -NCI Mouse EST Project 1999 
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Institution: 

Address: 

Tel: 

Fax: 

E-mail : 

CITATIONS 

Title: 
Authors : 



Year: 
Status : 



Washington University Scho ^ 5 ^ di ^ ne Louis , M 0 63108, USA 
4444 Forest Park Parkway, Box 8501, fat. 

314 286 1800 
314 286 1810 

mpuse^st@watsoii^ 

The WashU -NCI Mouse EST Project 1999 

Mafra.M.. Hillier.L.. Kucaba,T., Martin, J. , Beck^C y 

,M., McCann,R., Waterston, R . , Wilson, R. 
1999 

Unpublished 
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